A COMPARATIVE STUDY OF FUZZY MODELS IN DOCUMENT CLUSTERING

Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 1

Abstract

The availability of large quantity of text documents from the World Wide Web and business document management systems has made the dynamic separation of texts into new categories as a very important task for every business intelligence systems. Text document clustering is one of the emerging and most needed clustering techniques used to cluster documents with regard to similarity among documents. It is used widely in digital library management system in the modern context. Document clustering is widely applicable in areas such as search engines, web mining, information retrieval, and topological analysis. There are several clustering approaches available in the literature to cluster the document. But most of the existing clustering techniques suffer from a wide range of limitations. The existing clustering approaches face the issues like practical applicability, very less accuracy, more classification time etc. Thus a novel approach is needed for providing significant accuracy with less classification time. In recent times, inclusion of fuzzy logic in clustering provides better clustering results. One of the widely used fuzzy logic based clustering is Fuzzy C-Means (FCM) Clustering. In order to further improve the performance of clustering, this thesis uses Modified Fuzzy C-Means (MFCM) Clustering. The documents are ranked using Term Frequency–Inverse Document Frequency (TF–IDF) technique. From the experimental results, it can be observed that the proposed technique results in better clustering when compared to the FCM clustering technique.

Authors and Affiliations

G. MANIMEKALAI , K. SATHIYAKUMARI , V. PREAMSUDHA

Keywords

Related Articles

A Study on Enhancement of Loadability of Large-Scale Emerging Power Systems by Using FACTS Controllers

This study presents comprehensive review of various ethods/techniques for incorporation of differential algebraic equations (DAE) model of FACTS controllers and different type of loads such as a static, dynamic, and com...

A Novel Benchmark K-Means Clustering on Continuous Data

Cluster analysis is one of the prominent techniques in the field of data mining and k-means is one of the most well known popular and partitioned based clustering algorithms. K-means clustering algorithm is widely used i...

An Algorithmic Approach for Efficient Image Compression using Neuro-Wavelet Model and Fuzzy Vector Quantization Technique

Applications, which need to store large database and/or transmit digital images requiring high bit-rates over channels with limited bandwidth, have demanded improved image compression techniques. This paper describes pra...

Design of a new search engine for the information search using image as an input key

It’s a long time we didn’t actually think about this. Whenever we need an image or the information regarding an image to be searched, we have to give a hint or the keyword related to the image. But think of a situation w...

Robust TCP: An improvement on TCP protocol

The Transmission Control Protocol (TCP) is the most popular transport layer protocol for the internet. Congestion Control is used to increase the congestion window size if there is additional bandwidth on the network, an...

Download PDF file
  • EP ID EP113979
  • DOI -
  • Views 110
  • Downloads 0

How To Cite

G. MANIMEKALAI, K. SATHIYAKUMARI, V. PREAMSUDHA (2012). A COMPARATIVE STUDY OF FUZZY MODELS IN DOCUMENT CLUSTERING. International Journal on Computer Science and Engineering, 4(1), 114-124. https://europub.co.uk/articles/-A-113979