A Brief Survey On Document Clustering Techniques Using MATLAB
Journal Title: International Journal of Computer & organization Trends(IJCOT) - Year 2013, Vol 3, Issue 1
Abstract
Document clustering is a more specific technique for unsupervised document organization, it is generally considered to be a centralized process. Clustering methods can be used to automatically group the retrieved documents into a list of meaningful categories. This paper gives an overview of some of the mostly used document clustering techniques and introduces the matlab tool which provides us many functions that helps in the clustering of the documents. In particular we concentrate on the most commonly used clustering techniques Agglomerative hierarchical clustering and K-means that are commonly used for document clustering and related matlab functions available in the matlab toolbox.
Authors and Affiliations
Rachitha Sony. K rotha , Suneetha Merugula
A Knowledge discovery Approach in Shopping Complex Database (ASCD)
Data mining and Knowledge Discovery (KD) has been widely accepted as a key technology for enterprises to improve their abilities in data analysis, decision support and the automatic extraction of knowledge from dat...
Pattern Recognition for Finding Similarity of Webpages
We proposed a functional technique for identifying similar Web pages that is based on measuring tree similarity. In this paper we introduce an experiment with two methods for evaluating the similarity of web pages. The r...
Detection Of Brain Tumor Using Kernel Induced Possiblistic C-Means Clustering
Brain tumor is a major health problem throughout the world. Magnetic resonance imaging (MRI) scan can be used to produce image of any part of the body and it provides an efficient and fast way for diagnosis of the brain...
A New Approach For Image Cryptography Techniques
With the progress in data exchange by electronic system, the need of information security has become a necessity. Due to growth of multimedia applications, security becomes an important issue of communication and storage...
Impact Analysis of Modulation Formats in 40 Gbit/s DWDM Systems
The development of digital optical communications systems at high flow rates (NX40Gbit/s) and wavelength division multiplexed (WDM) reveals new issues such as the need to compensate the chromatic dispersion at higher lev...