A FREQUENT DOCUMENT MINING ALGORITHM WITH CLUSTERING

Journal Title: Indian Journal of Computer Science and Engineering - Year 2012, Vol 3, Issue 5

Abstract

Now days, finding the association rule from large number of item-set become very popular issue in the field of data mining. To determine the association rule researchers implemented a lot of algorithms and techniques. FPGrowth is a very fast algorithm for finding frequent item-set. This paper, give us a new idea in this field. It replaces the role of frequent item-set to frequent sub graph discovery. It uses the processing of datasets and describes modified FP-algorithm for sub-graph discovery. The document clustering is required for this work. It can use self-similarity function between pair of document graph that similarity can use for clustering with the help of affinity propagation and efficiency of algorithm can be measure by F-measure function.

Authors and Affiliations

Mr. Rakesh Kumar Soni , Prof. Neetesh Gupta , Prof. Amit Sinhal

Keywords

Related Articles

ON ROAD VEHICLE/OBJECT DETECTION AND TRACKING USING TEMPLATE

Vehicle tracking and detection plays an important role in traffic surveillance, still a crucial task in many applications. Till now, there is no standard method developed. Template matching is one of the methods used for...

ADAPTIVE PROGRESSIVE CODING FOR COMPRESSION OF BI-LEVEL VIDEO IMAGES

In video compression, progressive coding plays an important role in terms of coding efficiency and error resilience and has been an attractive research topic since the standardization of H.264/AVC. In this paper, we prop...

Decision Support in Heart Disease Prediction System using Naive Bayes

Data Mining refers to using a variety of techniques to identify suggest of information or decision making knowledge in the database and extracting these in a way that they can put to use in areas such as decision support...

PROCESS MODEL FOR REUSABILITY IN CONTEXT-SPECIFIC REUSABLE SOFTWARE COMPONENTS

Constructing component based software using reusable components is becoming a promising approach. Context-specific reuse is a broadly used way to increase the value of reuse. This paper reports our on-going work aimed at...

Performance Evolution of Various Wavelets in Cervical Lesion Detection

Cervical cancer is one of most common cancers among women in the world caused by human papilloma virus infection. It develops in the tissue of cervix which connects upper body of the uterus to the vagina. The types of ca...

Download PDF file
  • EP ID EP119837
  • DOI -
  • Views 123
  • Downloads 0

How To Cite

Mr. Rakesh Kumar Soni, Prof. Neetesh Gupta, Prof. Amit Sinhal (2012). A FREQUENT DOCUMENT MINING ALGORITHM WITH CLUSTERING. Indian Journal of Computer Science and Engineering, 3(5), 678-686. https://europub.co.uk/articles/-A-119837