Analyzing Performance of Map Reduce, Pig Latin and Hive on Windows Platform

Abstract

The Hadoop framework allows distributed processing of large data sets across clusters of commodity computers efficiently. MapReduce, the core programming language of the Hadoop Ecosystem processes the data stored in Hadoop Distributed File System (HDFS). It is difficult for non programmers to work with MapReduce. Hadoop supports HiveQL (SQL like statements) which implicitly and immediately translates the queries into one or more MapReduce jobs. To help procedural language developers, Hadoop supports Pig Latin language. This paper runs a text data processing application with MapReduce, Hive and Pig on single node windows platform and compares performance in graphical form.

Authors and Affiliations

Mr. Manishkumar R Solanki, Yashvi Shah, Siddhi Shukla, Shrusti Talati

Keywords

Related Articles

Achieving Load Balancing through Program Slicing

Implementing load balance in parallel program is very important. It may reduce running time and improve performance of program. This paper proposes a slicing algorithm in which we did not use any slicing criteria but we...

Optimization of Minimum Quantity Lubricant (MQL) Conditions in Milling of mild Steel

Minimum quantity lubrication (MQL) has been well established as an alternative to flood coolant processing. The optimization of MQL conditions is reducing the machining cost and improving the performance. In this study,...

Verifying Result Correctness of Outsourced Frequent Itemset in Data Mining through Probabilistic and deterministic approaches

Cloud computing technology has enabled large organization to outsource data to a third-party service provider (server) for data mining and has provided a natural solution for the data-mining paradigms. However, outsourci...

Reuse & Recirculation of Filter Backwash Water of Water Treatment Water

Most of the water treatment plant, filtration is done by means of sand filtration process. Due to continuous filtration process, sand pores get clogged and decreases the efficiency. For mitigating such problem, reverse f...

Prediction Of Fluidity Parameter In Thixoforming Process For Aluminum Alloy Using Fuzzy Logic Approach

Thixoforming or semisolid metal processing is an upcoming technology to obtain near net shaped components. The material is processed in between the solidus and liquidus temperatures. This process is also known as thixofo...

Download PDF file
  • EP ID EP392381
  • DOI 10.9790/9622-0709043235.
  • Views 83
  • Downloads 0

How To Cite

Mr. Manishkumar R Solanki, Yashvi Shah, Siddhi Shukla, Shrusti Talati (2017). Analyzing Performance of Map Reduce, Pig Latin and Hive on Windows Platform. International Journal of engineering Research and Applications, 7(9), 32-35. https://europub.co.uk/articles/-A-392381