Classifying Arabic Text Using KNN Classifier
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2016, Vol 7, Issue 6
Abstract
With the tremendous amount of electronic documents available, there is a great need to classify documents automatically. Classification is the task of assigning objects (images, text documents, etc.) to one of several predefined categories. The selection of important terms is vital to classifier performance, feature set reduction techniques such as stop word removal, stemming and term threshold were used in this paper. Three term-selection techniques are used on a corpus of 1000 documents that fall in five categories. A comparison study is performed to find the effect of using full-word, stem, and the root term indexing methods. K-nearest – neighbors classifiers used in this study. The averages of all folds for Recall, Precision, Fallout, and Error-Rate were calculated. The results of the experiments carried out on the dataset show the importance of using k-fold testing since it presents the variations of averages of recall, precision, fallout, and error rate for each category over the 10-fold.
Authors and Affiliations
Amer Al-Badarenah, Emad Al-Shawakfa, Khaleel Al-Rababah, Safwan Shatnawi, Basel Bani-Ismail
Distributed Group Key Management with Cluster based Communication for Dynamic Peer Groups
Secure group communication is an increasingly popular research area having received much attention in recent years. Group key management is a fundamental building block for secure group communication systems. This paper...
Student’s Opinions on Online Educational Games for Learning Programming Introductory
Use of educational games is an approach that has potential to change the existing educational method. This is due to games popularity among younger generation as well as engagement and fun features of games compared to c...
A Survey on using Neural Network based Algorithms for Hand Written Digit Recognition
The detection and recognition of handwritten content is the process of converting non-intelligent information such as images into machine edit-able text. This research domain has become an active research area due to vas...
Exploring Identifiers of Research Articles Related to Food and Disease using Artificial Intelligence
Currently hundreds of studies in the literature have shown the link between food and reducing the risk of chronic diseases. This study investigates the use of natural language processing and artificial intelligence techn...
On Prospects of Development of Telecommunication Systems and Services based on Virtual Reality Technology
Virtual reality technologies are considered to be a basis and a promising development trend of telecommunication systems’ and services. New opportunities and sci-tech problems that need to be solved are currently undergo...