Discovering Semantic and Sentiment Correlations using Short Informal Arabic Language Text

Abstract

Semantic and Sentiment analysis have received a great deal of attention over the last few years due to the important role they play in many different fields, including marketing, education, and politics. Social media has given tremendous opportunities for researchers to collect huge amount of data as input for their semantic and sentiment analysis. Using twitter API, we collected around 4.5 million Arabic tweets and used them to propose a novel automatic unsupervised approach to capture patterns of words and sentences of similar contextual semantics and sentiment in informal Arabic language at word and sentence levels. We used Language Modeling (LM) model which is statistical model that can estimate the distribution of natural language in effective way. The results of experiments of proposed model showed better performance than classic bigram and latent semantic analysis (LSA) model in most of cases at word level. In order to handle the big data, we used different text processing techniques followed by removal of the unique words based on their rele Informal Arabic, Big Data, Sentiment analysis, Opinion Mining (OM), semantic analysis, bigram model, LSA model, Twitter vance to problem.

Authors and Affiliations

Salihah AlOtaibi, Muhammad Badruddin Khan

Keywords

Related Articles

Classifying Personalization Constraints in Digital Business Environments through Case Study Research

To aid professionals in the early assessment of possible risks related to personalization activities in marketing as well as to give academics a starting point to discover not only the opportunities but also the risks of...

 A Survey of Automated Text Simplification

 Text simplification modifies syntax and lexicon to improve the understandability of language for an end user. This survey identifies and classifies simplification research within the period 1998-2013. Simplificatio...

Factors Influencing Patients’ Attitudes to Exchange Electronic Health Information in Saudi Arabia: An Exploratory Study

Health Information Exchange (HIE) systems electronically transfer patients’ clinical, demographic, and health-related information between different care providers. These exchanges offer improved health care quality, redu...

A Robust Hash Function Using Cross-Coupled Chaotic Maps with Absolute-Valued Sinusoidal Nonlinearity

This paper presents a compact and effective chaos-based keyed hash function implemented by a cross-coupled topology of chaotic maps, which employs absolute-value of sinusoidal nonlinearity, and offers robust chaotic regi...

Skew Detection/Correction and Local Minima/Maxima Techniques for Extracting a New Arabic Benchmark Database

We propose a set of techniques for extracting a new standard benchmark database for Arabic handwritten scripts. Thresholding, filtering, and skew detection/correction techniques are developed as a pre-processing step of...

Download PDF file
  • EP ID EP249793
  • DOI 10.14569/IJACSA.2017.080126
  • Views 113
  • Downloads 0

How To Cite

Salihah AlOtaibi, Muhammad Badruddin Khan (2017). Discovering Semantic and Sentiment Correlations using Short Informal Arabic Language Text. International Journal of Advanced Computer Science & Applications, 8(1), 198-207. https://europub.co.uk/articles/-A-249793