Hierarchical Classifiers for Multi-Way Sentiment Analysis of Arabic Reviews

Abstract

Sentiment Analysis (SA) is one of hottest fields in data mining (DM) and natural language processing (NLP). The goal of SA is to extract the sentiment conveyed in a certain text based on its content. While most current works focus on the simple problem of determining whether the sentiment is positive or negative, Multi-Way Sentiment Analysis (MWSA) focuses on sentiments conveyed through a rating or scoring system (e.g., a 5-star scoring system). In such scoring systems, the sentiments conveyed in two reviews of close scores (such as 4 stars and 5 stars) can be very similar creating an added challenge compared to traditional SA. One intuitive way of handling this challenge is via a divide-and-conquer approach where the MWSA problem is divided into a set of sub-problems allowing the use of customized classifiers to differentiate between reviews of close scores. A hierarchical classification structure can be used with this approach where each node represents a different classification sub-problem and the decision from it may lead to the invocation of another classifier. In this work, we show how the use of this divide-and-conquer hierarchical structure of classifiers can generate better results than the use of existing flat classifiers for the MWSA problem. We focus on the Arabic language for many reasons such as the importance of this language and the scarcity of prior works and available tools for it. To the best of our knowledge, very few papers have been published on MWSA of Arabic reviews. One notable work is that of Ali and Atiya, in which the authors collected a large scale Arabic Book Reviews (LABR) dataset and made it publicly available. Unfortunately, the baseline experiments on this dataset had very low accuracy. We present two different hierarchical structures and compare their accuracies with the flat structure using different core classifiers. The comparison is based on standard accuracy measures such as precision and recall in addition to using the mean squared error (MSE) as a more accurate measure given the fact that not all misclassifications are the same. The results show that, in general, hierarchical classifiers give significant improvements (of more than 50% in certain cases) over flat classifiers.

Authors and Affiliations

Mahmoud Al-Ayyoub, Aya Nuseir, Ghassan Kanaan, Riyad Al-Shalabi

Keywords

Related Articles

A Novel Approach for Background Subtraction using Generalized Rayleigh Distribution

Identification of the foreground objects in dynamic scenario video images is an exigent task, when compared to static scenes. In contrast to motionless images, video sequences offer more information concerning how items...

A Comparative Study of the Iterative Numerical Methods Used in Mine Ventilation Networks

Ventilation is one of the key safety tasks in underground mines. Determination of the airflow through mine openings and ducts is complex and often requires the application of numerical analysis. The governing equations u...

A Novel Rule-Based Root Extraction Algorithm for Arabic Language

Non-vocalized Arabic words are ambiguous words, because non-vocalized words may have different meanings. Therefore, these words may have more than one root. Many Arabic root extraction algorithms have been conducted to e...

Cloud Security based on the Homomorphic Encryption

Cloud computing provides services rather than products; where it offers many benefits to clients who pay to use hardware and software resources. There are many advantages of using cloud computing such as low cost, easy t...

A New Parallel Matrix Multiplication Algorithm on Tree-Hypercube Network using Iman1 Supercomputer

The tree-hypercube (TH) interconnection network is relatively a new interconnection network, which is constructed from tree and hypercube topologies. TH is developed to support parallel algorithms for solving computation...

Download PDF file
  • EP ID EP101370
  • DOI 10.14569/IJACSA.2016.070269
  • Views 107
  • Downloads 0

How To Cite

Mahmoud Al-Ayyoub, Aya Nuseir, Ghassan Kanaan, Riyad Al-Shalabi (2016). Hierarchical Classifiers for Multi-Way Sentiment Analysis of Arabic Reviews. International Journal of Advanced Computer Science & Applications, 7(2), 531-539. https://europub.co.uk/articles/-A-101370