Arabic Text Classification using Feature-Reduction Techniques for Detecting Violence on Social Media

Abstract

With the current increase in the number of online users, there has been a concomitant increase in the amount of data shared online. Techniques for discovering knowledge from these data can provide us with valuable information when it comes to detecting different problems, including violence. Violence is one of the significant problems humanity has faced in recent years all over the world, and this is especially a problem in Arabic countries. To address this issue, this research focuses on detecting violence-related tweets to help in solving this problem. Text mining is an important technique that can be used to find and predict information from text. In this study, a text classification model is built for detecting violence in Arabic dialects on Twitter using different feature-reduction approaches. The experiment comprises bagging, K-nearest neighbors (KNN), and Bayesian boosting using different extraction features, namely, root-based stemming, light stemming, and n-grams. In addition, the study used the following feature-reduction techniques: support vector machine (SVM), Chi-squared (CHI), the Gini index, correlation, rules, information gain (IG), deviation, symmetrical uncertainty, and the IG ratio. The experiment showed that the bagging with tri-gram approach has the highest accuracy at 86.61%, and a combination of IG with SVM from reduction features registers an accuracy of 90.59%.

Authors and Affiliations

Hissah ALSaif, Taghreed Alotaibi

Keywords

Related Articles

Extreme Learning Machine and Particle Swarm Optimization for Inflation Forecasting

Inflation is one indicator to measure the development of a nation. If inflation is not controlled, it will have a lot of negative impacts on people in a country. There are many ways to control inflation, one of them is f...

Developing Computer Network Based on EIGRP Performance Comparison and OSPF

One of the computer network systems technologies that are growing rapidly at this time is internet. In building the networks, a routing mechanism is needed to integrate the entire computer with a high degree of flexibili...

LBPH-based Enhanced Real-Time Face Recognition

Facial recognition has always gone through a consistent research area due to its non-modelling nature and its diverse applications. As a result, day-to-day activities are increasingly being carried out electronically rat...

Proposal for Scrambled Method based on NTRU

Scrambling is widely used to protect the security of data files such as text, image, video or audio files; however, it is not the most efficient method to protect the security of the data files. This article uses NTRU pu...

Context Aware Fuel Monitoring System for Cellular Sites

The past decade has been very productive for cellular operators of Pakistan, as their subscribers have grown exponentially with increase in revenue. After this wave of rising, the operators have now reached to saturation...

Download PDF file
  • EP ID EP550259
  • DOI 10.14569/IJACSA.2019.0100409
  • Views 101
  • Downloads 0

How To Cite

Hissah ALSaif, Taghreed Alotaibi (2019). Arabic Text Classification using Feature-Reduction Techniques for Detecting Violence on Social Media. International Journal of Advanced Computer Science & Applications, 10(4), 77-87. https://europub.co.uk/articles/-A-550259