A Review: Hadoop Storage and Clustering Algorithms
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 1
Abstract
Abstract : In the last few years there has been voluminous increase in the storing and processing of data, which require convincing speed and also requirement of storage space. Big data is a defined as large, diverseand complex data sets which has issues of storage, analysis and visualization for processing results. Four characteristics of Big data which are–Volume, Value, Variety and Velocity makes it difficult for traditional systems to process the big data. Apache Hadoop is an auspicious software framework that develops applications that process huge amounts of data in parallel with large clusters of commodity hardware in a fault-tolerant and veracious manner. Various performance metrics such as reliability, fault tolerance, accuracy, confidentiality and security are improved as with the use of Hadoop. Hadoop MapReduce is an effective Computation Model for processing large data on distributed data clusters such as Clouds. We first introduce the general idea of big data and then review related technologies, such as could computing and Hadoop. Various clustering techniques are also analyzed based on parameters like numbers of clusters, size of clusters, type of dataset and noise.
Authors and Affiliations
Latika Kakkar , Gaurav Mehta
Pre-Processing Method for Extraction of Pectoral Muscle and Removal of Artefacts in Mammogram
Abstract: Mammography is an effective imaging modality used by radiologists for detection of breast cancer and effective suppression of pectoral muscle as well as removal of noise in form of artefacts aids accurate...
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
One of the most difficult tasks in the whole KDD process is to choose the right data mining technique, as the commercial software tools provide more and more possibilities together and the decision requir...
Design and Architecture for Web Graph Mining Base Recommender System for Query, Image and Social Network using Query Suggestion Algorithm and Heat Diffusion Method
Recommendation techniques is now a day’s very important. various kinds of recommendations are done on the Web, example movies, music, images, books recommendations, query suggestions and tags recommendations, etc....
A Review Paper on Cross Platform Mobile ApplicationDevelopment IDE
Abstract: Cross Platform Mobile Application Development is the development of mobile based applications sothat the development of these types of applications can be made platform-independent. A review has been made...
Web Data mining-A Research area in Web usage mining
Data mining technology has emerged as a means for identifying patterns and trends from large quantities of data. The data mining technology normally adopts data integration method to generate data warehouse, on...