AN EFFICIENT TEXT CLASSIFICATION USING KNN AND NAIVE BAYESIAN
Journal Title: International Journal on Computer Science and Engineering - Year 2012, Vol 4, Issue 3
Abstract
The main objective is to propose a text classification based on the features selection and preprocessing thereby reducing the dimensionality of the Feature vector and increase the classification accuracy. Text classification is the process of assigning a document to one or more target categories, based on its contents. In the proposed method, machine learning methods for text classification is used to apply some text preprocessing methods in different dataset, and then to extract feature vectors for each new document by using various feature weighting methods for enhancing the text classification accuracy. Further training the classifier by Naive Bayesian (NB) and K-nearest neighbor (KNN) algorithms, the predication can be made according to the category distribution among this k nearest neighbors. Experimental results show that the methods are favorable in terms of their effectiveness and efficiency when compared with other classifier such as SVM.
Authors and Affiliations
J. Sreemathy , P. S. Balamurugan
Multimodal Authentication For High End Security
In earlier days Traditional authentication method in computer systems are based on knowledge and token based. Unfortunately lot of drawbacks are in these systems. Passwords often be forgotten, disclosed or changed. A rel...
Data Warehouse Schema Evolution and Adaptation Framework Using Ontology
Data Warehouse systems aim at integrating data from multiple heterogeneous, distributed, autonomous data sources. Due to changing business needs the data warehouse systems are never meant to be static. Changes in the dat...
Performance of machine learning methods for classification tasks
In this paper, the performance of various machine learning methods on pattern classification and recognition tasks are proposed. The proposed method for evaluating performance will be based on the feature representation,...
Use of Fuzzy Logic Approaches in Scheduling of FMS: A Review
Scheduling in an flexible manufacturing systems(FMS) environment is more complex and difficult than a conventional manufacturing environment. Therefore, determining an optimal schedule and controlling an FMS is considere...
OUTLOOK ON VARIOUS SCHEDULING APPROACHES IN HADOOP
MapReduce is used for processing and generating sets large data .A open source framework of MapReduce is Hadoop [1]. MapReduce and Hadoop represent a good alternative for efficient large scale data processing and advance...