Frequent Pattern Mining using CATSIM Tree
Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 9
Abstract
Efficient algorithms to discover frequent patterns are essential in data mining research. Frequent pattern mining is emerging as powerful tool for many business applications such as e-commerce, recommender systems and supply chain management and group decision support systems to name a few. Several effective data structures, such as two-dimensional arrays, graphs, trees and tries have been proposed to collect candidate and frequent itemsets. It seems as the tree structure is most extractive to storing itemsets. The outstanding tree has been proposed so far is called FP-tree which is a prefix tree structure. Some advancement with the FP tree structure is proposed as CATS tree. CATS Tree extends the idea of FP-Tree to improve storage compression and allow frequent pattern mining without generation of candidate itemsets. It allows to mine only through a single pass over the database. The efficiency of Apriori, FP-Growth, CATS Tree for incremental mining is very poor. In all of the above mentioned algorithms, it is required to generate tree repeatedly to support incremental mining. The implemented CATSIM Tree uses more memory compared to Apriori, FP-Growth and CATS Tree, but with advancement in technology, is not a major concern. In this work CATSIM Tree with modifications in CATS Tree is implemented to support incremental mining with better results.
Authors and Affiliations
Ketan Modi , B. L. Pal
Colorectal Cancer MRI Image Segmentation Using Image Processing Techniques
Colorectal cancer is the third most commonly diagnosed cancer and the second leading cause of cancer death in men and women. Magnetic resonance imaging (MRI) established itself as the primary method for detection and sta...
S-boxes generated using Affine Transformation giving Maximum Avalanche Effect
The Advanced Encryption Standard (AES) was published by National Institute of Standards and Technology (NIST) in November 2001, to replace DES (Data Encryption Standard) and Triple DES. The S-box (Substitution box) used...
IP Address Blocking System
Hosting a site on the Internet makes it available everywhere. There are certain sites that are just meant for local use like local shopping marts that do not provide products for purchase in other countries. Also, there...
CLASSIFICATION OF LAND USE LAND COVER CHANGE DETECTION USING REMOTELY SENSED DATA
Image classification is perhaps the most important part of digital image analysis. With supervised classification, the information classes of interest like land cover type image. These are called “training sites”. The im...
A Novel Texture Synthesis Algorithm Using Patch Matching by Fuzzy Texture Unit
Texture is an important spatial feature useful for identifying objects or regions of interest in an image. This paper presents a novel texture characterization method based on Fuzzy Texture Unit (FTU) for texture synthes...