Improving Efficiency of Apriori Algorithms for Sequential Pattern Mining 

Journal Title: Bonfring International Journal of Data Mining - Year 2014, Vol 4, Issue 1

Abstract

Computer Systems are exposed to an increasing number of different types of security threats due to the expanding of internet in recent years. How to detect network intrusions effectively becomes an important security technique. Many intrusions arenot composed by single events, but by a series of attack steps taken in chronological order. Analyzing the order in which events occur can improve the attack detection accuracy and reduce false alarms. Intrusion is a multi step process in which a number of events must occur sequentially in order to launch a successful attack. Intrusion detection using sequential pattern mining is a research topic focusing on the field of information security. Sequential Pattern Mining is used to discover the frequent sequential pattern in the event dataset. Sequential Pattern mining algorithms can be broadly classified into Apriori based, Pattern growth based and a combination of both. The first algorithm is based on the characteristic of Apriori and the second uses a pattern growth approach. The major drawback of the Apriori based algorithm is the multiple scans of the database, generating maximal patterns. In this paper, a simulation study of both the algorithms, a modified AprioriALL Algorithm to optimize the processing by including set theory techniques and the original AprioriALL algorithm is done on a network intrusion dataset from KDD cup 1999. Experimental results show that the modified algorithm shrinks the dataset size. At the most, it also scans the database twice. Also, as the interestingness of the itemset is increased with the dataset shrinking it leads to efficient sequences with high associativity. As the database is reduced, the time taken to mine sequences also reduces and is faster than Apriori based algorithm. 

Authors and Affiliations

Alpa Reshamwala, Dr. Sunita Mahajan

Keywords

Related Articles

Literature Review on Web Mining 

Web is a platforms for information exchange, as it is simple and easy to publish documents. Searching for information becomes a difficult and time-consuming process as the web grows. Web mining uses various data mining t...

Conditional Variables Double Sampling Plan for Weibull Distributed Lifetimes under Sudden Death Testing

n this paper, we propose a conditional sampling plan called conditional double sampling plan for lot acceptance of parts whose life time follows a Weibull distribution with known shape parameter under sudden death testin...

An Efficient Meta Scheduling based Virtual Consolidation for Resource Sharing in Green Cloud

In modern researchers, cloud parallel data processing has emerging resource that to be one of the problematic application for Infrastructure-as-a-Service (IaaS) clouds. Major Cloud processing companies include starting i...

Consensus Clustering for Microarray Gene Expression Data 

Cluster analysis in microarray gene expression studies is used to find groups of correlated and co-regulated genes. Several clustering algorithms are available in the literature. However no single algorithm is optimal fo...

An Analytical Study on Early Diagnosis and Classification of Diabetes Mellitus 

Diabetes mellitus (DM) is a chronic, general, life-threatening syndrome occurring all around the world. It is characterized by hyperglycemia occurring due to abnormalities in insulin secretion which would in turn result...

Download PDF file
  • EP ID EP152973
  • DOI 10.9756/BIJDM.4774
  • Views 102
  • Downloads 0

How To Cite

Alpa Reshamwala, Dr. Sunita Mahajan (2014). Improving Efficiency of Apriori Algorithms for Sequential Pattern Mining . Bonfring International Journal of Data Mining, 4(1), 1-6. https://europub.co.uk/articles/-A-152973