Analyzing Performance of Map Reduce, Pig Latin and Hive on Windows Platform
Journal Title: International Journal of engineering Research and Applications - Year 2017, Vol 7, Issue 9
Abstract
The Hadoop framework allows distributed processing of large data sets across clusters of commodity computers efficiently. MapReduce, the core programming language of the Hadoop Ecosystem processes the data stored in Hadoop Distributed File System (HDFS). It is difficult for non programmers to work with MapReduce. Hadoop supports HiveQL (SQL like statements) which implicitly and immediately translates the queries into one or more MapReduce jobs. To help procedural language developers, Hadoop supports Pig Latin language. This paper runs a text data processing application with MapReduce, Hive and Pig on single node windows platform and compares performance in graphical form.
Authors and Affiliations
Mr. Manishkumar R Solanki, Yashvi Shah, Siddhi Shukla, Shrusti Talati
Achieving Load Balancing through Program Slicing
Implementing load balance in parallel program is very important. It may reduce running time and improve performance of program. This paper proposes a slicing algorithm in which we did not use any slicing criteria but we...
Optimization of Minimum Quantity Lubricant (MQL) Conditions in Milling of mild Steel
Minimum quantity lubrication (MQL) has been well established as an alternative to flood coolant processing. The optimization of MQL conditions is reducing the machining cost and improving the performance. In this study,...
Verifying Result Correctness of Outsourced Frequent Itemset in Data Mining through Probabilistic and deterministic approaches
Cloud computing technology has enabled large organization to outsource data to a third-party service provider (server) for data mining and has provided a natural solution for the data-mining paradigms. However, outsourci...
Reuse & Recirculation of Filter Backwash Water of Water Treatment Water
Most of the water treatment plant, filtration is done by means of sand filtration process. Due to continuous filtration process, sand pores get clogged and decreases the efficiency. For mitigating such problem, reverse f...
Prediction Of Fluidity Parameter In Thixoforming Process For Aluminum Alloy Using Fuzzy Logic Approach
Thixoforming or semisolid metal processing is an upcoming technology to obtain near net shaped components. The material is processed in between the solidus and liquidus temperatures. This process is also known as thixofo...