MapReduce Programs Simplification using a Query Criteria API

Abstract

A Hadoop HDFS is an organized and distributed collection of files. It is created to store a huge part of data and then retrieve it and analyze it efficiently in a less amount of time. To retrieve and analyze data from the Hadoop HDFS, MapReduce Jobs must be created directly using some programming languages like Java or indirectly using some high level languages like HiveQL and PigLatin. Everyone knows that creating MapReduce programs using programming languages is a difficult task that requires a remarkable effort for their creation and also for their maintenance. Writing MapReduce code by hand needs a lot of time, introduce bugs, harm readability, and impede optimizations. Profiles working in the field of big data always try to avoid hard and long programs in their work. They are always looking for much simpler alternatives like graphical interfaces or reduced scripts like PIG Latin or even SQL queries. This article proposes to use a MapReduce Query API inspired from Hibernate Criteria to simplify the code of MapReduce programs. This API proposes a set of predefined methods for making restrictions, projections, logical conditions and so on. An implementation of the Word Count example using the Query Criteria API is illustrated in this paper.

Authors and Affiliations

Boulchahoub Hassan, Khalil Namir, Amina Rachiq, Labriji Elhoussin, Benabbou Fouzia

Keywords

Related Articles

A Compact Broadband Antenna for Civil and Military Wireless Communication Applications

This paper presents a compact broadband antenna for civil and military wireless communication applications. Two prototypes of the antenna are designed and simulated. The proposed antenna is etched on low cost substrate m...

Integrating Semantic Features for Enhancing Arabic Named Entity Recognition

Named Entity Recognition (NER) is currently an essential research area that supports many tasks in NLP. Its goal is to find a solution to boost accurately the named entities identification. This paper presents an integra...

Generation of Attributes for Bangla Words for Universal Networking Language(UNL)

The usage of native language through Internet is highly demanding now a day due to rapidly increase of Internet based application in daily needs. It is important to read all information in Bangla from the internet. Unive...

Effective Teaching Methods and Proposed Web Libraries for Designing Animated Course Content: A Review

The primary aim of education system is to improve cognitive and computational skills in students. It cannot be achieved by just using the latest technology. This goal can only be achieved through effective teaching metho...

Multivariate Copula Modeling with Application in Software Project Management and Information Systems

This paper discusses application of copulas in software project management and information systems. Successful software projects depend on accurate estimation of software development schedule. In this research, three maj...

Download PDF file
  • EP ID EP319654
  • DOI 10.14569/IJACSA.2018.090607
  • Views 81
  • Downloads 0

How To Cite

Boulchahoub Hassan, Khalil Namir, Amina Rachiq, Labriji Elhoussin, Benabbou Fouzia (2018). MapReduce Programs Simplification using a Query Criteria API. International Journal of Advanced Computer Science & Applications, 9(6), 50-54. https://europub.co.uk/articles/-A-319654