Filter-Wrapper Approach to Feature Selection Using PSO-GA for Arabic Document Classification with Naive Bayes Multinomial

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2015, Vol 17, Issue 6

Abstract

Abstract: Text categorization and feature selection are two of the many text data mining problems. In text categorization, the document that contains a collection of text will be changed to the dataset format, the dataset that consists of features and class, words become features and categories ofdocuments become class on this dataset. The number of features that too many can cause a decrease in performance of classifier because many of the features that are redundant and not optimal so that feature selection is required to select the optimal features. This paper proposed a feature selectionstrategy based on Particle Swarm Optimization (PSO) and Genetic Algorithm (GA) methods for Arabic Document Classification with Naive Bayes Multinomial (NBM). Particle Swarm Optimization (PSO) is adopted in the first phase with the aim to eliminate the insignificant features and prepared the reduce features to the next phase. In the second phase, the reduced features are optimized using the new evolutionary computation method, Genetic Algorithm (GA). These methods have greatly reduced the features and achieved higher classification compared with full features without features selection. From the experiment that has been done the obtained results of accuracy are NBM85.31%, NBM-PSO 83.91% and NBM-PSO-GA 90.20%.

Authors and Affiliations

Indriyani , Wawan Gunawan , Ardhon Rakhmadi

Keywords

Related Articles

A Survey on Routing Protocols influencing Mobile Sink in enhancing life time of WSN for Data gathering

Abstract: Wireless Sensor Network is a group of specialized sensors have the ability to sense, monitor, communicate to the neighbors and recording conditions at assorted locations such as temperature, sound, pressure, se...

 A Survey on Balancing the Network Load Using GeographicHash Tables

 Abstract: The load Balancing in the network is a severe problem in network. The data created in wirelessnetwork is kept on node. It accessed over geographic hash table. The geographic hash table is used to recoverd...

Dynamic Advertising: A Big Data Analytics Approach

Due to development of malls there is a severe impact on local shops, leading to decline in sales of groceries, fruits and vegetables, processed foods, garments, shoes, electronic and electrical goods. All such local shop...

 Green Computing Under Cloud Environment Proposed architecture using cloud computing & thin client

 Private Cloud computing provides attractive & cost efficient Server Based Computing (SBC). The implementation of Thin client computing for private cloud computing will reduce the IT Cost and consumes less pow...

Hand Gesture Recognition System for Creating & Controlling Media Player using Mat Lab Tool

Abstract: In this paper we have discussed a dynamic hand gesture recognition system for creating and controlling the media player. The system is made possible by using the concepts of threshold and color detection. The...

Download PDF file
  • EP ID EP122852
  • DOI -
  • Views 120
  • Downloads 0

How To Cite

Indriyani, Wawan Gunawan, Ardhon Rakhmadi (2015). Filter-Wrapper Approach to Feature Selection Using PSO-GA for Arabic Document Classification with Naive Bayes Multinomial. IOSR Journals (IOSR Journal of Computer Engineering), 17(6), 45-51. https://europub.co.uk/articles/-A-122852