HIERARCHICAL DOCUMENT ORGANIZATION AND RETRIEVAL BASED ON THEMES FOR NEWS TRACKS
Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 12
Abstract
Organizing text documents is an important task and there are also numbers of strategies available in it. A good document clustering approach can assist computers in organizing the document corpus automatically into a meaningful cluster hierarchy for efficient browsing and navigation, which is very valuable for overcoming the deficiencies of traditional information retrieval methods. By clustering the text documents, the documents sharing the same topic are grouped together. Unlike document classification, no labelled documents are provided in clustering. Hence clustering is also known as unsupervised learning. In case of term based data retrieval, time consumption problem prevails. This is because as for each term, the data set’s has to be retrieved. Hence we are going for taxonomy based data retrieval. This paper presents the taxonomical approach of clustering data set in a dynamic environment. It is a difficult task to cluster data in a dynamic environment. But this can be made easily by using RSS feeds.
Authors and Affiliations
S. M. Arnica Sowmi , D. Dinesh Babu
A Framework for Analyzing Software Quality using Hierarchical Clustering
Fault proneness data available in the early software life cycle from previous releases or similar kind of projects will aid in improving software quality estimations. Various techniques have been proposed in the literatu...
Building Personalized and Non Personalized Recommendation Systems
The contents of e-Commerce such as music, movies, books and electronics goods are necessary for a modern life style. But, it becomes difficult to find content according to users likes and users preference. An approach wh...
Resource-Aware Load Balancing Scheme using Multi-objective Optimization in Cloud Computing
Cloud computing is a service based, on-demand, pay per use model consisting of an interconnected and virtualizes resources delivered over internet. In cloud computing, usually there are number of jobs that need to be exe...
Effective Term Based Text Clustering Algorithms
Text clustering methods can be used to group large sets of text documents. Most of the text clustering methods do not address the problems of text clustering such as very high dimensionality of the data and understandabi...
Noise Control in Industries by Adaptive MDCT Method
Industrial noise induced hearing loss is an increasingly prevalent disorder that is the result of exposure to high intensity sounds, especially over a long period of time. .Noises of industry can cause partial deafness,...