Opinion Mining: An Approach to Feature Engineering

Abstract

Sentiment Analysis or opinion mining refers to a process of identifying and categorizing the subjective information in source materials using natural language processing (NLP), text analytics and statistical linguistics. The main purpose of opinion mining is to determine the writer’s attitude towards a particular topic under discussion. This is done by identifying a polarity of a particular text paragraph using different feature sets. Feature engineering in pre-processing phase plays a vital role in improving the performance of a classifier. In this paper we empirically evaluated various features weighting mechanisms against the well-established classification techniques for opinion mining, i.e. Naive Bayes-Multinomial for binary polarity cases and SVM-LIN for multiclass cases. In order to evaluates these classification techniques we use Rotten Tomatoes publically available movie reviews dataset for training the classifiers as this is widely used dataset by research community for the same purpose. The empirical experiment concludes that the feature set containing noun, verb, adverb and adjective lemmas with feature-frequency (FF) function perform better among all other feature settings with 84% and 85% correctly classified test instances for Naïve Bayes and SVM, respectively.

Authors and Affiliations

Shafaq Siddiqui, M. Abdul Rehman, Sher M. Daudpota, Ahmad Waqas

Keywords

Related Articles

Scrum Method Implementation in a Software Development Project Management

To maximize the performance, companies conduct a variety of ways to increase the business profit. The work management between one company and the other company is different, so the differences in the management may cause...

Cardiotocographic Diagnosis of Fetal Health based on Multiclass Morphologic Pattern Predictions using Deep Learning Classification

Medical complications of pregnancy and pregnancy-related deaths continue to remain a major global challenge today. Internationally, about 830 maternal deaths occur every day due to pregnancy-related or childbirth-related...

Automatic Sign Language Recognition: Performance Comparison of Word based Approach with Spelling based Approach

Evolution of computer based interaction has been through a number of phases. From command line interface to menu driven environment to Graphics User Interface, the communication has evolved to a better user friendly envi...

OPTIMIZING THE USE OF AN SPI FLASH PROM IN MICROBLAZE-BASED EMBEDDED SYSTEMS

This paper aims to simplify FPGA designs that incorporate Embedded Software Systems using a soft core Processor. It describes a simple solution to reduce the need of multiple non-volatile memory devices by using one SPI...

ABJAD Arabic-Based Encryption

The researcher introduced an enhanced classical Arabic-based encryption technique that is essentially designed for Arab nations. The new algorithm uses the shared key technique where the Keyword system Modulus is employe...

Download PDF file
  • EP ID EP498437
  • DOI 10.14569/IJACSA.2019.0100320
  • Views 91
  • Downloads 0

How To Cite

Shafaq Siddiqui, M. Abdul Rehman, Sher M. Daudpota, Ahmad Waqas (2019). Opinion Mining: An Approach to Feature Engineering. International Journal of Advanced Computer Science & Applications, 10(3), 159-165. https://europub.co.uk/articles/-A-498437