Frequent Itemset Mining Technique in Data Mining  

Abstract

In computer science and data mining, Apriori is a classic algorithm for learning association rules. Apriori is designed to operate on databases containing transactions (for example, collections of items bought by customers, or details of a website frequentation). Frequent itemsets play an essential role in many data mining tasks that try to find interesting patterns from databases, such as association rules, correlations, sequences, episodes, classifiers, clusters and many more of which the mining of association rules is one of the most popular problems. In this paper, we take the classic Apriori algorithm, and improve it quite significantly by introducing what we call a vertical sort. We then use the large dataset, web documents to contrast our performance against several state-of-the-art implementations and demonstrate not only equal efficiency with lower memory usage at all support thresholds, but also the ability to mine support thresholds as yet un-attempted in literature. We also indicate how we believe this work can be extended to achieve yet more impressive results. We have demonstrated that our implementation produces the same results with the same performance as the best of the state-of-the art implementations. In particular, we have started with the classic algorithm for this problem and introduced a conceptually simple idea, sorting the consequences of which have permitted us to outperform all of the available state-of-the-art implementations. 

Authors and Affiliations

Sanjaydeep Singh Lodhi , Premnarayan Arya , Dilip Vishwakarma

Keywords

Related Articles

Implementation of Matched Filter Based DSSS Digital GPS Receiver  

The Global Positioning System (GPS) is a satellite-based radio navigation system made up of a network of 24 satellites placed in an orbit by the US Department of Defense. GPS was originally intended for military applicat...

A Survey on Recent Trends in Cloud Computing and its Application for Multimedia 

Cloud computing has been the emerging technology in the recent years and computing has shifted it base to the clouds taking the world of computing to cloud computing. This paper surveys some of the recent technol...

A survey on AODV routing protocol for AD-HOC Network

Now a day, Ad-hoc network has become an indivisible part for communication for mobile devices. There are different types of topology for implementation of Ad-hoc network. AODV is one of them which are a reactive protocol...

Healthcare monitoring system for web-enabled smart buildings  

This paper describes the implementation of a wireless healthcare device in a web technology based smart building. The device has multiple communication interfaces like Bluetooth and 6LoWPAN, and multiple monitoring...

Cloud Computing Data Storage and Security Enhancement  

Cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable computing resources. Cloud computing provides computation, software, data access, and storage services t...

Download PDF file
  • EP ID EP109743
  • DOI -
  • Views 115
  • Downloads 0

How To Cite

Sanjaydeep Singh Lodhi, Premnarayan Arya, Dilip Vishwakarma (2012). Frequent Itemset Mining Technique in Data Mining  . International Journal of Advanced Research in Computer Engineering & Technology(IJARCET), 1(5), 395-404. https://europub.co.uk/articles/-A-109743