Auto-assemblage for Suffix Tree Clustering
Journal Title: International Journal of Advanced Research in Computer Engineering & Technology(IJARCET) - Year 2012, Vol 1, Issue 4
Abstract
Due to explosive growth of extracting the information from large repository of data, to get effective results, clustering is used. Clustering makes the searching efficient for better search results. Clustering is the process of grouping of similar type content. Document Clustering; organize the documents of similar type contents into groups. Partitioned and Hierarchical clustering algorithms are mainly used for clustering the documents. In this paper, k-means describe the partitioned clustering algorithm and further hierarchical clustering defines the Agglomerative hierarchical clustering and Divisive hierarchical clustering. The paper presents the tool, which describe the algorithmic steps that are used in Suffix Tree Clustering (STC) algorithm for clustering the documents. STC is a search result clustering, which perform the clustering on the dataset. Dataset is the collection of the text documents. The paper focuses on the steps for document clustering by using the Suffix Tree Clustering Algorithm. The algorithm steps are display by the screen shots that is taken from the running tool.
Authors and Affiliations
Pushplata , Mr. Ram Chatterjee
A Tailored Ontology Sculpt For Web Information Congregation
As a sculpt for acquaintance explanation and exemplification, ontologies are extensively used to symbolize consumer profiles in tailored web information congregation. Conversely, when representing consumer profiles...
Optical Character Recognition Techniques: A survey
Assessing Quality of Software Service at Selection Time Using Evolutionary Algorithm
The integration of external software in project development can be challenging and risky, because as the execution quality of the external software and the trustworthiness of the software is unknown during integrat...
Improved Performance of Multiuser System Using Combined Diversity with Nagakami Fading Channel
In wireless communication security, Signal to Noise Ratio (SNR) and Bit Error Rate (BER) are the important parameters to be improved. Frequency Hopping Spread Spectrum (FHSS) combined with Nagakami Fading Channel...
Performance Enhancement of Data Communication through Visible Light Communication Using On Off Keying
Visible Light Communication (VLC) refers to short-range optical wireless communication using visible light spectrum from 380 to 780nm and it has many advantages such as it can offer speeds up to 10GB/S. The other a...