Mining Frequent Patterns with Optimized Candidate Representation on GPU using Parallel Eclat Algorithm
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2019, Vol 21, Issue 1
Abstract
Frequent itemset mining is one of the important aspects of association rule mining. The primary algorithm based for frequent itemset mining is mostly based on CPU and they generated a large set of items that are required to be kept in memory all the time while processing. In this dissertation thesis, we designed a parallel Eclat algorithm. The algorithm will run on GPU and perform the task of the frequentitemset mining in parallel. The proposed algorithm also uses the optimized candidate representation and the frequent item sets generated are stored in cache memory and are fetched directly from the cache memory. The proposed algorithm runs in parallel and also uses the optimized candidate representation and thus provides better performance than the classical eclat algorithm. Thus, the proposed algorithm runs much faster than the classical eclat algorithm and has better performance than classical eclat algorithm in terms of memory and time.
Authors and Affiliations
Janki Kansagra, Dr. Nilesh Kalani, Dr. Paresh Tanna
A Quantified Approach for large Dataset Compression in Association Mining
With the rapid development of computer and information technology in the last several decades, an enormous amount of data in science and engineering will continuously be generated in massive scale; data com...
Machine Learning techniques for filtering of unwanted messages in Online Social Networks
As of Recent Years, Online Social Networks have transformed into a key bit of step by step life. One key issue in today user wall(s) is to give users the ability to control the messages posted in solitude private s...
Detecting Vulnerable User in Twitter Using Tweet Description Logic Rule Generation
In today’s modern era, Social media have become a mandatory, exciting and mundane apparatus in our lives and diverse informal organizations have distinctive focused set of people. Among these networks, twitter standout i...
A Novel Wrapper-filter Hybrid Method for Candidate SNPs Selection
Abstract: Genomic studies provide massive amount of data including thousands of Single Nucleotide Polymorphisms (SNPs). The analysis of SNPs helps to identify genetic variants related to complex traits. Therefore, it is...
Analysis of C4.5 and K-Nearest Neighbor (KNN) Method on Algorithm of Clustering For Deciding Mainstay Area
Development as a sustainable activity needs a good plan, so the programs can be effective and have a clear objective. Therefore, a model to help the analysis is significantly needed in determining the priority area...