Arabic Text Classification using Feature-Reduction Techniques for Detecting Violence on Social Media

Abstract

With the current increase in the number of online users, there has been a concomitant increase in the amount of data shared online. Techniques for discovering knowledge from these data can provide us with valuable information when it comes to detecting different problems, including violence. Violence is one of the significant problems humanity has faced in recent years all over the world, and this is especially a problem in Arabic countries. To address this issue, this research focuses on detecting violence-related tweets to help in solving this problem. Text mining is an important technique that can be used to find and predict information from text. In this study, a text classification model is built for detecting violence in Arabic dialects on Twitter using different feature-reduction approaches. The experiment comprises bagging, K-nearest neighbors (KNN), and Bayesian boosting using different extraction features, namely, root-based stemming, light stemming, and n-grams. In addition, the study used the following feature-reduction techniques: support vector machine (SVM), Chi-squared (CHI), the Gini index, correlation, rules, information gain (IG), deviation, symmetrical uncertainty, and the IG ratio. The experiment showed that the bagging with tri-gram approach has the highest accuracy at 86.61%, and a combination of IG with SVM from reduction features registers an accuracy of 90.59%.

Authors and Affiliations

Hissah ALSaif, Taghreed Alotaibi

Keywords

Related Articles

Programming Technologies for the Development of Web-Based Platform for Digital Psychological Tools

The choice of the tools and programming technologies for information systems creation is relevant. For every projected system, it is necessary to define a number of criteria for development environment, used libraries an...

Designing a Multi Agent System Architecture for IT Governance Platform

This paper presents a multi-agents architecture which facilitates the integration of three major IT governance frameworks: COBIT5, ITIL V3 and ISO/IEC27002, to optimize the construction of a distributed system. This arch...

 Bins Formation using CG based Partitioning of Histogram Modified Using Proposed Polynomial

 This paper proposes a novel polynomial transform to modify the original histogram of the image to adjust the pixel density equally towards the high intensity levels so that uniform distribution of the pixels can be...

Analysis of Valuable Clustering Techniques for Deep Web Access and Navigation

A massive amount of content is available on web but huge portion of it is still invisible. User can only access this hidden web, also called Deep web, by entering a directed query in a web search form and thus accessing...

Analyzing Personality Traits and External Factors for Stem Education Awareness using Machine Learning

The purpose of the paper is to present the personality traits and the factors that influence a student to pursue STEM education using machine learning techniques. STEM courses have high regard because they play a vital r...

Download PDF file
  • EP ID EP550259
  • DOI 10.14569/IJACSA.2019.0100409
  • Views 110
  • Downloads 0

How To Cite

Hissah ALSaif, Taghreed Alotaibi (2019). Arabic Text Classification using Feature-Reduction Techniques for Detecting Violence on Social Media. International Journal of Advanced Computer Science & Applications, 10(4), 77-87. https://europub.co.uk/articles/-A-550259