Study of Apache Hadoop

Abstract

 Apache Hadoop is an open-source software framework for distributed storage and distributed processing of Big Data on clusters of commodity hardware. . The settings for the Hadoop environment are critical for deriving the full benefit from the rest of the hardware and software. The Distribution for Apache Hadoop* software includes Apache Hadoop* and other software components optimized to take advantage of hardware-enhanced performance and security capabilities.The Apache Hadoop project defines HDFS as “the primary storage system used by Hadoop applications” that enables reliable ,extremely rapid computations. Its Hadoop Distributed File System (HDFS) splits files into large blocks (default 64MB or 128MB) and distributes the blocks amongst the nodes in the cluster. Hadoop uses a distributed user-level filesystem. It takes care of storing data -- and it can handle very large amount of data.

Authors and Affiliations

Keywords

Related Articles

An Enhancement of Classical Unsharp Mask filter for Contrast and Edge Preservation

Various images are low quality, difficult to detect and extract information. Therefore, enhancement of contrast and Sharpness of an image is required in many applications. Unsharp masking is a good tool for sharpness e...

 SOIL STABILIZATION USING ROCK DUST AND SLUDGE

 The combined effects of two wastes Sludge and Rock dust on, compaction characteristics, California Bearing Ratio (CBR), Shear strength parameters and Swelling pressure of an expansive soil have been discussed in t...

VEHICLE REGISTRATION PLATE DETECTION AND RECOGNITION SYSTEM

ehicle Registration Plate (VRP) Detection & Recognition System is one type of intellectual transport system and is of considerable interest because of its application in detecting the registration plate affixed o...

 Enactment and Performance Analysis of Discrete Transform based Watermarking Algorithms for Digitized Images

 A digital watermark is a kind of marker covertly embedded in a noise-tolerant signal such as audio or image data. It is typically used to identify ownership of the copyright of such signal. "Watermarking" is the p...

 Private Information Retrieval from Cloud in a Distributed Location

 Cloud computing a big buzzword now-a-days and IT Industry talks about it a lot and they started to move to Cloud. Cloud is mainly for Storage, Elasticity, Sharing, and Fast Access. Mainly for storage purpose Priva...

Download PDF file
  • EP ID EP137378
  • DOI -
  • Views 54
  • Downloads 0

How To Cite

(2014).  Study of Apache Hadoop. International Journal of Engineering Sciences & Research Technology, 3(12), 270-275. https://europub.co.uk/articles/-A-137378