Filter-Wrapper Approach to Feature Selection Using PSO-GA for Arabic Document Classification with Naive Bayes Multinomial
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2015, Vol 17, Issue 6
Abstract
Abstract: Text categorization and feature selection are two of the many text data mining problems. In text categorization, the document that contains a collection of text will be changed to the dataset format, the dataset that consists of features and class, words become features and categories ofdocuments become class on this dataset. The number of features that too many can cause a decrease in performance of classifier because many of the features that are redundant and not optimal so that feature selection is required to select the optimal features. This paper proposed a feature selectionstrategy based on Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) methods for Arabic Document Classification with Naive Bayes Multinomial (NBM). Particle Swarm Optimization (PSO) is adopted in the first phase with the aim to eliminate the insignificant features and prepared the reduce features to the next phase. In the second phase, the reduced features are optimized using the new evolutionary computation method, Genetic Algorithm (GA). These methods have greatly reduced the features and achieved higher classification compared with full features without features selection. From the experiment that has been done the obtained results of accuracy are NBM85.31%, NBM-PSO 83.91% and NBM-PSO-GA 90.20%.
Authors and Affiliations
Indriyani , Wawan Gunawan , Ardhon Rakhmadi
Machine Learning techniques for filtering of unwanted messages in Online Social Networks
As of Recent Years, Online Social Networks have transformed into a key bit of step by step life. One key issue in today user wall(s) is to give users the ability to control the messages posted in solitude private s...
Review on Comment Volume Prediction
Abstract: In this paper we present the concept of social media and its various functional building blocks. Social media has become an ubiquitous part of social networking and content sharing. Social media make use of mob...
Traffic Dynamics in Virtual Routing Multi Topology System
Providing a better performance is the key in IP network systems.An Adaptive Multipath Routing(AMR) system is introduced to handle the unpredicted traffic dynamics. The proposed system consists of Weight Computa...
Security Implication of Social Networking in the Corporate Environment
Abstract: Social media offers basic business inclinations to associations and affiliations, furthermore has most likely comprehended security perils. With a particular finished objective to reduce these security risks an...
MSESEP- Mobile Sink Based ESEP using Reliable Cluster Head and Sorting Technique
Abstract: The Wireless Sensor Network (WSN) is composed of sensors. These sensor nodes sense the physical parameters like temperature, pressure, humidity etc. In real time environment these sensors have different energie...