TEXT MINING USING KEYPHRASE EXTRACTION
Journal Title: Indian Journal of Computer Science and Engineering - Year 2010, Vol 1, Issue 2
Abstract
Text mining is powerful tool to find useful and needed information from huge data set. For context based text mining, keyphrases are used. Keyphrases provide brief summary about the contents of documents. In document clustering, number of total cluster is not known in advance. In K-means, if respecified number of clusters modified, the precision of each result is also modified. Therefore Kea ,is algorithm for automatically extracting keyphrases from text is used. In this kea algorithm, number of clusters is automatically determined by using extracted keyphrases. Keameans clustering algorithm provide easy and efficient way to extract test document from large quantity of resources. Keyphrase play important role in text indexing, summarization and categorization. Keyphrases are selected manually. Assigning keyphrases manually is tedious process that requires knowledge of subject. Therefore automatic extraction techniques are most useful.
Authors and Affiliations
Shobha S. Raskar , D. M. Thakore
A PKI ARCHITECTURE USING OPEN SOURCE SOFTWARE FOR EGOVERNMENT SERVICES IN ROMANIA
This article presents an architecture based on Open Source software that promote citizen’s access to electronic services in a secure way and attempt to make an analysis between two different Open Source Public Key Infras...
IDENTIFICATION OF MOST RELEVANT FEATURES FOR SENTIMENT ANALYSIS USING HETEROGENIC DOMAIN
The overwhelming majority of existing approaches to opinion feature extraction accept mining patterns solely from one review corpus, ignoring the nontrivial disparities in word spacing characteristics of opinion options...
Link Stability Based Hop By Hop Multicast Protocol For Vanets
Vanets are new emerging and challenging technology that makes an improvisation in traffic safety and efficiency. The constant growth of automobile industry is increasing the demand for car safety and the car to car conne...
SOFTWARE RELIABILITY OF PROFICIENT ENACTMENT
A software reliability exemplary projects snags the random process as disillusionments which were the culmination yield of two progressions: emerging faults and initial state values. The predominant classification uses t...
Searching SNT in XML Documents Using Reduction Factor
XML has become the most popular standard for data representation. In XML standard the documents represented as rooted ordered trees. The efficient query processing can be performed on the labeled document structure. The...