Improvising Data Locality and Availability in Hbase Ecosystem

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2014, Vol 16, Issue 2

Abstract

 Abstract: In this paper, we try to represent the importance of data locality with the HBase architecture. HBase has a dynamic master slave architecture but the emphasis on data locality, i.e. getting the logic or processing near to the data is the major phenomenon followed for better and efficient performance. Data Locality is valid as every region server has the information of every data blocks located in respective regions but what if the region server crashes or the region server is restarted or the regions are randomly re-distributed with all the region servers due to load balancing, then data locality is completely lost during that time. Performance is majorly affected if there is misconfiguration of data locality in the cluster. The HMaster uses [4] .META table to get information about the region server that has its specified regions containing rows. Keeping an eye on this disadvantages and challenges, we propose to improvise data locality by allocating maximum regions to that region server which had the maximum data blocks of that region in it. An algorithm is proposed based on HRegion locality index for deciding the criteria of allocating the regions to region servers for maintaining data locality.

Authors and Affiliations

Shalini Sharma , Satyajit Padhy

Keywords

Related Articles

 Recapitulating the development initiatives of a robust information  security safeguard: RITSB-the proposed solution

 Most current information security systems performance vary with the nature of the filed its being operating. With an increased emphasizes on the adoption of security tools and technologies, the anomalies and &nb...

 Flexible Dynamic Recommender System

 A Recommender System now becoming decision maker for the people who lack sufficient personal experience to evaluate the items that are on website. It provides recommendation for specific items such as books, news,...

 K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizontal Aggregations

 Data mining refers to the process of analyzing the data from different perspectives and summarizing it into useful information that is mostly used by the different users for analyzing the data as well as for p...

 A Reconfigurable Model In Wireless Sensor Network for saving wild life

 Abstract: The paper presents the design and evaluation of Wireless Sensor Network for prompt detection of forest fires. At first it presents the vital aspects in sculpting forest fires. It is being ensured by s...

 Qualitative Study on the efficiency of Load balancing algorithmsin Cloud Environment

 Abstract: Load balancing in cloud is different from the typical architecture of load balancing techniques. Thisopens up new opportunities and challenges. Resource management is the use of available processors in th...

Download PDF file
  • EP ID EP121399
  • DOI 10.9790/0661-162113641
  • Views 115
  • Downloads 0

How To Cite

Shalini Sharma, Satyajit Padhy (2014).  Improvising Data Locality and Availability in Hbase Ecosystem. IOSR Journals (IOSR Journal of Computer Engineering), 16(2), 36-41. https://europub.co.uk/articles/-A-121399