HDFS: Erasure-Coded Information Repository System for Hadoop Clusters

Abstract

Existing disk based recorded stockpiling frameworks are insufficient for Hadoop groups because of the obliviousness of information copies and the guide decrease programming model. To handle this issue, a deletion coded information chronicled framework called HD-FS is developed for Hadoop bunches, where codes are utilized to file information copies in the Hadoop dispersed document framework or HD-FS. Here there are two chronicled systems that HDFS-Grouping and HDFS-Pipeline in HDFS to accelerate the information documented process. HDFS-Grouping is a Map Reduce-based information chronicling plan - keeps every mapper's moderate yield Key-Value matches in a nearby key-esteem store and unions all the transitional key-esteem sets with a similar key into one single key-esteem combine, trailed by rearranging the single Key-Value match to reducers to create last equality squares. HDFS-Pipeline frames an information recorded pipeline utilizing numerous information hub in a Hadoop group. HDFS-Pipeline conveys the consolidated single key-esteem combine to an ensuing hub's nearby key-esteem store. Last hub in the pipeline is mindful to yield equality squares. HD-FS is executed in a true Hadoop group. The exploratory outcomes demonstrate that HDFS-Grouping and HDFS-Pipeline accelerate Baseline's rearrange and diminish stages by a factor of 10 and 5, individually. At the point when square size is bigger than 32 M-B, HD-FS enhances the execution of HDFS-RA-ID and HDFS-EC by roughly 31.8 and 15.7 percent, separately. Ameena Anjum | Prof. Shivleela Patil"HDFS: Erasure-Coded Information Repository System for Hadoop Clusters" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-5 , August 2018, URL: http://www.ijtsrd.com/papers/ijtsrd18206.pdf http://www.ijtsrd.com/computer-science/other/18206/hdfs-erasure-coded-information-repository-system-for-hadoop-clusters/ameena-anjum

Authors and Affiliations

Keywords

Related Articles

Potential of Neem Leaf Powder as Bio Adsorbents for Dye Colour Removal

In this study, two types of eco friendly and low cost bio adsorbents, Neem leaf powder NLP and acid treated Neem leaf powder TNLP were prepared for the removal of dye color from Congo red solution. The physicochemical pa...

Efficient Design of 2 1 MUX Multiplexer using Nanotechnology Based on QCA

Quantum Dot Cellular Automata is a new technology which overcomes of the of CMOS limitations. It is an novel advanced nano-technology that revolves around the single-electron position control. It is one of the most effic...

Endothelial Nitric Oxide Synthase T786C Gene Polymorphism TT Genotype is Associated with High Nitric Oxide Levels and Low HOMA IR Levels in Coronary Artery Disease Patients

BACKGROUND Data suggest eNOS 786T C gene polymorphism is distinct in specific population group, ethnicity and geographic region and perhaps this genetic variability might produce different results on exposure to various...

Consumer Data Management

This paper explores the Consumer Data Management, Consumer Data Management CDM area as the process and framework for collecting, managing, and analyzing consumer data from various sources in order to form a unified view...

The Comparison of Statistical Quality Control Results on Reinforced Concrete Buildings in Myanmar

Yangon, the former capital city of Myanmar, gradually increases high rise buildings. But, Myanmar, our country, is one of the countries which still weakened at the quality management system of construction projects. Few...

Download PDF file
  • EP ID EP389959
  • DOI -
  • Views 74
  • Downloads 0

How To Cite

(2018). HDFS: Erasure-Coded Information Repository System for Hadoop Clusters. International Journal of Trend in Scientific Research and Development, 2(5), 1957-1960. https://europub.co.uk/articles/-A-389959