A FREQUENT DOCUMENT MINING ALGORITHM WITH CLUSTERING - Europub

Search

Apply

A FREQUENT DOCUMENT MINING ALGORITHM WITH CLUSTERING

Journal Title: Indian Journal of Computer Science and Engineering - Year 2012, Vol 3, Issue 5

Abstract

Now days, finding the association rule from large number of item-set become very popular issue in the field of data mining. To determine the association rule researchers implemented a lot of algorithms and techniques. FPGrowth is a very fast algorithm for finding frequent item-set. This paper, give us a new idea in this field. It replaces the role of frequent item-set to frequent sub graph discovery. It uses the processing of datasets and describes modified FP-algorithm for sub-graph discovery. The document clustering is required for this work. It can use self-similarity function between pair of document graph that similarity can use for clustering with the help of affinity propagation and efficiency of algorithm can be measure by F-measure function.

Authors and Affiliations

Mr. Rakesh Kumar Soni , Prof. Neetesh Gupta , Prof. Amit Sinhal

Keywords

Clustering document-graph FP-Growth graph mining frequent sub graphs clustering.

Related Articles

JOINT CHANNEL ESTIMATION AND DECODING OF RAPTOR CODE ON FADING CHANNEL

In this paper, the problem of transmission of Raptor codes over fading channel is considered. We present in this paper joint decoder architecture for Raptor codes over phase coherent fading channel. The proposed scheme d...

ENRICHMENT OF SECURITY THROUGH CRYPTOGRAPHIC PUBLIC KEY ALGORITHM BASED ON BLOCK CIPHER

In recent years Data Security using cryptography has emerged as a topic of significant interest in both academic and industry circles. This paper deals with a new algorithm, which is based on linear block cipher. Our goa...

COMPARATIVE STUDY OF DIFFERENT APPROACHES FOR EFFICIENT RECTIFICATION UNDER GENERAL MOTION

This paper is concerned with the generating an efficient fundamental matrix for image rectification problem under motion. Computation of fundamental matrix is the information source to estimate the accurate optic flow in...

IMPROVING THE SOFTWARE ARCHITECTURE THROUGH FUZZY CLUSTERING TECHNIQUE.

Software Architecture Recovery is one of the finest parts of reverse engineering. Several different techniques have been adopted in vast literature to recover Software Architecture. One of the techniques is clustering, w...

DETECTION AND CLASSIFICATION OF TUMORS IN CT IMAGES

Image segmentation is the process of partitioning a digital image into multiple segments or set of pixels. The objective of image segmentation is to group pixels into a prominent image region. In this paper, segmentation...

Download PDF file

EP ID EP119837
DOI -
Views 143
Downloads 0