HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems

Abstract

An efficient and distributed scheme for file mapping or file lookup is critical in decentralizing metadata management within a group of metadata servers. This paper presents a novel technique called Hierarchical Bloom Filter Arrays (HBA) to map filenames to the metadata servers holding their metadata. Two levels of probabilistic arrays, namely, the Bloom filter arrays with different levels of accuracies, are used on each metadata server. One array, with lower accuracy and representing the distribution of the entire metadata, trades accuracy for significantly reduced memory overhead, whereas the other array, with higher accuracy, caches partial distribution information and exploits the temporal locality of file access patterns. Both arrays are replicated to all metadata servers to support fast local lookups. We evaluate HBA through extensive trace-driven simulations and implementation in Linux. Simulation results show our HBA design to be highly effective and efficient in improving the performance and scalability of file systems in clusters with 1,000 to 10,000 nodes (or super clusters) and with the amount of data in the peta byte scale or higher. Our implementation indicates that HBA can reduce the metadata operation time of a single-metadata-server architecture by a factor of up to 43.9 when the system is configured with 16 Meta data servers. M S Nirmala"HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-5 , August 2018, URL: http://www.ijtsrd.com/papers/ijtsrd18211.pdf http://www.ijtsrd.com/engineering/electronics-and-communication-engineering/18211/hba-distributed-metadata-management-for-large-cluster-based-storage-systems/m-s-nirmala

Authors and Affiliations

Keywords

Related Articles

Design Optimization of Reinforced Concrete Slabs Using Various Optimization Techniques

This paper presents Reinforced Concrete RC slab design optimization technique for finding the best design parameters that satisfy the project requirements both in terms of strength and serviceability criteria while keepi...

A Survey on Classification and Prediction Techniques in Data Mining for Diabetes Mellitus

The medical industry incredibly utilizes the data mining systems for different expectations and characterization. The substantial data repositories produced is subjected to different calculations to distinguish the examp...

A Study on Geopolymer with Dyeing Industry Effluent Treatment Plant Sludge

In this paper, it is envisaged to project a new composite material, which can be made from the existing non-degradable and hazardous waste materials. The composite material is a combination of Fly ash Geopolymer FAG and...

A Study to Assess the Effectiveness of Structured Teaching Programme on Knowledge and Skill Regarding Management of Patients Admitted in Hospital Triage Setting

Introduction Most patients with life threatening or potential life threatening problems arrive at the hospital through the emergency department. Many more patients report to the emergency department for less urgent condi...

Design of an Air Conditioning System for a Commercial Building using Air Handling Unit

An Air Handling Unit is a central air conditioner station that handles the air that usually, will be supplied into the buildings by the ventilation ductwork connecting to it. Handling the air means that the air will be d...

Download PDF file
  • EP ID EP389971
  • DOI -
  • Views 59
  • Downloads 0

How To Cite

(2018). HBA: Distributed Metadata Management for Large Cluster-Based Storage Systems. International Journal of Trend in Scientific Research and Development, 2(5), 1966-1971. https://europub.co.uk/articles/-A-389971