MapReduce Performance in MongoDB Sharded Collections

Abstract

In the modern era of computing and countless of online services that gather and serve huge data around the world, processing and analyzing Big Data has rapidly developed into an area of its own. In this paper, we focus on the MapReduce programming model and associated implementation for processing and analyzing large datasets in a NoSQL database such as MongoDB. Furthermore, we analyze the performance of MapReduce in sharded collections with huge dataset and we measure how the execution time scales when the number of shards increases. As a result, we try to explain when MapReduce is an appropriate processing technique in MongoDB and also to give some measures and alternatives to take when MapReduce is used.

Authors and Affiliations

Jaumin Ajdari, Brilant Kasami

Keywords

Related Articles

Sorting Pairs of Points Based on Their Distances

Sorting data is one of the main problems in computer science which studied vastly and used in several places. In several geometric problems, like problems on point sets or lines in the plane or Euclidean space with highe...

An Emergency Unit Support System to Diagnose Chronic Heart Failure Embedded with SWRL and Bayesian Network

In all the regions of the world, heart failure is common and on raise caused by several aetiologies. Although the development of the treatment is fast, there are still lots of cases that lose their lives in emergence sec...

Identifying and Extracting Named Entities from Wikipedia Database Using Entity Infoboxes

An approach for named entity classification based on Wikipedia article infoboxes is described in this paper. It identifies the three fundamental named entity types, namely; Person, Location and Organization. An entity cl...

Investigating the Effect of Different Kernel Functions on the Performance of SVM for Recognizing Arabic Characters

A considerable progress in the recognition techniques of Latin and Chinese characters has been achieved. By contrast, Arabic Optical character Recognition is still lagging in spite that the interest and research in this...

Image Mining: Review and New Challenges

Besides new technology, a huge volume of data in various form has been available for people. Image data represents a keystone of many research areas including medicine, forensic criminology, robotics and industrial autom...

Download PDF file
  • EP ID EP320052
  • DOI 10.14569/IJACSA.2018.090617
  • Views 65
  • Downloads 0

How To Cite

Jaumin Ajdari, Brilant Kasami (2018). MapReduce Performance in MongoDB Sharded Collections. International Journal of Advanced Computer Science & Applications, 9(6), 115-120. https://europub.co.uk/articles/-A-320052