AN EFFICIENT TEXT CLASSIFICATION USING KNN AND NAIVE BAYESIAN
Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 3
Abstract
The main objective is to propose a text classification based on the features selection and preprocessing thereby reducing the dimensionality of the Feature vector and increase the classification accuracy. Text classification is the process of assigning a document to one or more target categories, based on its contents. In the proposed method, machine learning methods for text classification is used to apply some text preprocessing methods in different dataset, and then to extract feature vectors for each new document by using various feature weighting methods for enhancing the text classification accuracy. Further training the classifier by Naive Bayesian (NB) and K-nearest neighbor (KNN) algorithms, the predication can be made according to the category distribution among this k nearest neighbors. Experimental results show that the methods are favorable in terms of their effectiveness and efficiency when compared with other classifier such as SVM.
Authors and Affiliations
J. Sreemathy , P. S. Balamurugan
Prediction of the Query of Search Engine Using Back Propagation Algorithm
The information user is depending on the Search Engine; therefore search engines are required as a prediction system to predict the next query hit by the user. Web mining techniques, like neural network can be used for t...
Resilience Against Node Capture Attack using Asymmetric Matrices in Key Predistribution Scheme in Wireless Sensor Networks
Wireless Sensor Networks (WSN) usually consists of a large number of tiny sensors with limited computation capability, memory space and power resource. WSN’s are extremely vulnerable against any kind of internal or exter...
Secured Image Sharing and Deletion in the Cloud Storage Using Access Policies
Cloud computing is a general term for anything that involves delivering hosted services, Anything as a Service (AaaS), over the web on demand basis. It uses web and central remote servers to maintain data and application...
Fault Tolerance in Real Time Distributed System
In this paper we investigate the different techniques of fault tolerance which are used in many real time distributed systems. The main focus is on types of fault occurring in the system, fault detection techniques and t...
Trust Worthy Architecture Implementation for Mobile Ad hoc Networks
A mobile ad hoc network is a wireless communication network that does not rely on a fixed infrastructure and is lack of any centralized control. The wireless and distributed nature of mobile ad hoc networks poses greater...