AN EFFICIENT TEXT CLASSIFICATION USING KNN AND NAIVE BAYESIAN
Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 3
Abstract
The main objective is to propose a text classification based on the features selection and preprocessing thereby reducing the dimensionality of the Feature vector and increase the classification accuracy. Text classification is the process of assigning a document to one or more target categories, based on its contents. In the proposed method, machine learning methods for text classification is used to apply some text preprocessing methods in different dataset, and then to extract feature vectors for each new document by using various feature weighting methods for enhancing the text classification accuracy. Further training the classifier by Naive Bayesian (NB) and K-nearest neighbor (KNN) algorithms, the predication can be made according to the category distribution among this k nearest neighbors. Experimental results show that the methods are favorable in terms of their effectiveness and efficiency when compared with other classifier such as SVM.
Authors and Affiliations
J. Sreemathy , P. S. Balamurugan
User Suggestions Extraction from customer Reviews
Customer review is a major criterion for the improvement of the quality of services rendered and enhancement of the deliverables. Blogs, articles and discussion forums, provide manufacturers or sellers with a good unders...
Anonymous Ad Hoc Network Performance Degradation Analysis in the Presence of Selfish
Wireless Ad Hoc networks are characterized by dynamic change in topology, infrastructureless architecture and creation of a network on the fly. Being wireless and co-operative in nature Ad Hoc networks are more prone to...
SIMULATION BASED DESIGN OF RETENTION TANK OF MODULAR CONTROLLER DISCHARGE SYSTEM (MCDS) FOR TRAIN COACHES
As increasingly more complex embedded systems are being considered for design, their design and validation is proving a Herculean task. Innovative applications demand stringent requirements, necessitating improvements in...
A Novel Density based improved k-means Clustering Algorithm – Dbkmeans
Abstract: Mining knowledge from large amounts of spatial data is known as spatial data mining. It becomes a highly demanding field because huge amounts of spatial data have been collected in various applications ranging...
Adaptability of IEEE 802.15.4(Zigbee) Protocol for Wireless Sensor network
The IEEE 802.15.4/Zigbee protocol stack has been considered as a promising technology for Wireless Sensor Networks (WSN). IEEE 802.15.4 Low-Rate Wireless Personal Area Network (WPAN) standard specifies the lower protocol...