Improvising Data Locality and Availability in Hbase Ecosystem

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 2

Abstract

 Abstract: In this paper, we try to represent the importance of data locality with the HBase architecture. HBase has a dynamic master slave architecture but the emphasis on data locality, i.e. getting the logic or processing near to the data is the major phenomenon followed for better and efficient performance. Data Locality is valid as every region server has the information of every data blocks located in respective regions but what if the region server crashes or the region server is restarted or the regions are randomly re-distributed with all the region servers due to load balancing, then data locality is completely lost during that time. Performance is majorly affected if there is misconfiguration of data locality in the cluster. The HMaster uses [4] .META table to get information about the region server that has its specified regions containing rows. Keeping an eye on this disadvantages and challenges, we propose to improvise data locality by allocating maximum regions to that region server which had the maximum data blocks of that region in it. An algorithm is proposed based on HRegion locality index for deciding the criteria of allocating the regions to region servers for maintaining data locality.

Authors and Affiliations

Shalini Sharma , Satyajit Padhy

Keywords

Related Articles

 Fast Remote data access for control of TCP/IP network using android Mobile device

 Abstract: In today’s world most of the mobile have the use more than its basic functionality. As mobile becomes more advance to be have same architecture same as desktop system. Hence this feature should be used as...

 Database Applications in Analyzing Agents

 Abstract: There are many situations in which two or more agents (e.g., human or computer decision makers)interact with each other repeatedly in settings that can be modeled as repeated stochastic games. In suchsitu...

 Protection of Direct and Indirect Discrimination using Prevention  Methods

 Along with privacy, discrimination is a very important issue when considering the legal and ethical aspects of data mining. It is more than observable that the majority people do not want to be discriminated &nbs...

An Effective m-Health System for Antenatal and Postnatal Care in Rural Areas of Bangladesh

Abstract: In South Asia, Maternal Mortality Rate (MMR) is so high due to the lack of health facility, doctor’s insufficiency, lack of communication facility and also the poverty. Bangladesh is also suffering from this un...

 Fidelity Analysis of Additive and Multiplicative Watermarked Images in Integrated Domain

Abstract: The escalation of internet has increased the usage of multimedia contents for wide range of functions. The easy access of the digital contents paves way to manipulate, edit and duplicate the contents using av...

Download PDF file
  • EP ID EP121399
  • DOI 10.9790/0661-162113641
  • Views 116
  • Downloads 0

How To Cite

Shalini Sharma, Satyajit Padhy (2014).  Improvising Data Locality and Availability in Hbase Ecosystem. IOSR Journals (IOSR Journal of Computer Engineering), 16(2), 36-41. https://europub.co.uk/articles/-A-121399