Association Rule Mining in Big Data using MapReduce approach in Hadoop

Journal Title: GRD Journal for Engineering - Year 2016, Vol 1, Issue 0

Abstract

The concept of Association rule mining is an important task in data mining. In case of big data the large volume of data makes is impossible to generate rules at a faster pace. By making use of parallel execution in Hadoop using the MapReduce framework, the rules can be generated much faster and in an efficient way. The existing method transforms the input dataset into binomial representation before processing them using MapReduce. But binomial conversion is not user-friendly since it is complex in case of continuous values. In this paper, an improved and scalable algorithm is proposed for association rule mining that will convert the input dataset into key-value pairs instead of binomial. All the stages of proposed association rule mining algorithm are parallelized using MapReduce. The proposed algorithm works on high cardinality features and so no dimension detection is needed.

Authors and Affiliations

J. Jenifer Nancy, M. Jansi Rani, Dr. D. Devaraj

Keywords

Related Articles

Intelligent Pothole Repair Vehicle

Identifying and repairing potholes on the roads is labour intensive and expensive. It typically requires three or four people to do the monotonous job in difficult environments. So this is a great opportunity for a type...

Vehicle Security System using Embedded and GSM Technology

This paper deals with design and development of the theft control system for an automobile, which is being used to prevent or control the theft. The developed system makes use of an embedded system based on GSM technolog...

Dynamic Value Stream Mapping in the Realm of Green Manufacturing

Value stream mapping is a lean tool which is used in lean manufacturing to find the flow of information and material, as a product it makes way through value stream. These envision tool helps to understand processes by u...

Security and Privacy Enhancing in Multi-Cloud Architecture with Data De-duplication

Cloud computing makes IT more efficient and cost effective in today’s world. Cloud computing act as a virtual server that the user can access via internet on a needed basis and this eliminates the need for the companies...

Fiber Reinforced Polymer : A Smarter Material for the Smarter Constructions

Civil Engineering has a very important role in practical life because it creates a safe foundation & provides facility for smooth conduction of life. For that purpose, a material named Fiber Reinforced Polymer (FRP) is i...

Download PDF file
  • EP ID EP303006
  • DOI -
  • Views 77
  • Downloads 0

How To Cite

J. Jenifer Nancy, M. Jansi Rani, Dr. D. Devaraj (2016). Association Rule Mining in Big Data using MapReduce approach in Hadoop. GRD Journal for Engineering, 1(0), 179-186. https://europub.co.uk/articles/-A-303006