High Performance CDR Processing with MapReduce
Journal Title: Journal of ICT Research and Applications - Year 2016, Vol 10, Issue 2
Abstract
A call detail record (CDR) is a data record produced by telecommunication equipment consisting of call detail transaction logs. It contains valuable information for many purposes in several domains, such as billing, fraud detection and analytical purposes. However, in the real world these needs face a big data challenge. Billions of CDRs are generated every day and the processing systems are expected to deliver results in a timely manner. The capacity of our current production system is not enough to meet these needs. Therefore a better performing system based on MapReduce and running on Hadoop cluster was designed and implemented. This paper presents an analysis of the previous system and the design and implementation of the new system, called MS2. In this paper also empirical evidence is provided to demonstrate the efficiency and linearity of MS2. Tests have shown that MS2 reduces overhead by 44% and speeds up performance nearly twice compared to the previous system. From benchmarking with several related technologies in large-scale data processing, MS2 was also shown to perform better in the case of CDR batch processing. When it runs on a cluster consisting of eight CPU cores and two conventional disks, MS2 is able to process 67,000 CDRs/second.
Authors and Affiliations
Mulya Agung, Imam Kistijantoro
Parallel Technique for Medicinal Plant Identification System using Fuzzy Local Binary Pattern
As biological image databases are growing rapidly, automated species identification based on digital data becomes of great interest for accelerating biodiversity assessment, research and monitoring. This research applied...
Design of Triple-Band Bandpass Filter Using Cascade Tri-Section Stepped Impedance Resonators
In this research, a triple-band bandpass filter (BPF) using a cascade tri section step impedance resonator (TSSIR), which can be operated at 900 MHz, 1,800 MHz, and 2,600 MHz simultaneously, was designed, fabricated and...
Document Grouping by Using Meronyms and Type-2 Fuzzy Association Rule Mining
The growth of the number of textual documents in the digital world, especially on the World Wide Web, is incredibly fast. This causes an accumulation of information, so we need efficient organization to manage textual do...
Adjusting Time of Flight in Ultrasound B-mode Imaging for Accurate Measurement of Fat using Image Segmentation Technique
This research attempted to measure chicken intramuscular fat content using improved ultrasound B-mode images and image segmentation. Adapted B-mode imaging is proposed to increase the positioning accuracy of B-mode image...
Mining High Utility Itemsets with Regular Occurrence
High utility itemset mining (HUIM) plays an important role in the data mining community and in a wide range of applications. For example, in retail business it is used for finding sets of sold products that give high pro...