Study of Apache Hadoop

Abstract

 Apache Hadoop is an open-source software framework for distributed storage and distributed processing of Big Data on clusters of commodity hardware. . The settings for the Hadoop environment are critical for deriving the full benefit from the rest of the hardware and software. The Distribution for Apache Hadoop* software includes Apache Hadoop* and other software components optimized to take advantage of hardware-enhanced performance and security capabilities.The Apache Hadoop project defines HDFS as “the primary storage system used by Hadoop applications” that enables reliable ,extremely rapid computations. Its Hadoop Distributed File System (HDFS) splits files into large blocks (default 64MB or 128MB) and distributes the blocks amongst the nodes in the cluster. Hadoop uses a distributed user-level filesystem. It takes care of storing data -- and it can handle very large amount of data.

Authors and Affiliations

Keywords

Related Articles

 NONLINERA COUPLING ANALYSIS FOR HYPERSONIC FLIGHT VEHICLE

 Based on 3D figuration, the nonlinear coupling between the airframe and scramjet for hypersonic vehicle is investigated in this paper. Firstly, a free-free beam structure elastic model is utilized taking into acco...

 VHTD Mechanism for Secure Data Transfer in Manets

 In MANETs, nodes move freely and so the topology of the nodes is highly dynamic. The process of routing the data packets to the destination is a challenging task. Most existing ad hoc routing protocols are propose...

 Testing of Randomness of the Number Generated by Fisher and Yates

 Proper randomness of the numbers generated by Fisher and Yates has been examined by Chakrabarty in2010 by computing the probability of occurrence of each digit in the table of generated numbers and then applying t...

 PARAMETRIC OPTIMIZATION FOR IMPROVING THE PERFORMANCE OF SINGLE SLOPE SOLAR STILL THROUGH EXPERIMENTAL STUDIES

 An attempt is made to improve the performance of single slope solar still for the production of fresh water in this study. The prime aim is to experimentally investigate the influence of control factors namely ston...

 A DATA GUIDED LEXI-SERACH APPROACH FOR TIME DEPENDENT TRAVELLING SALEMSMAN PROBLEM

 A simple lexi-search algorithm that uses path representation method for the time dependent traveling salesman problem TDTSP is proposed, to obtain exact optimal solution to the problem. Then a data-guided lexi se...

Download PDF file
  • EP ID EP137378
  • DOI -
  • Views 67
  • Downloads 0

How To Cite

(2014).  Study of Apache Hadoop. International Journal of Engineering Sciences & Research Technology, 3(12), 270-275. https://europub.co.uk/articles/-A-137378