Optimizing the Hyperparameter of Feature Extraction and Machine Learning Classification Algorithms

Abstract

The process of assigning a quantitative value to a piece of text expressing a mood or effect is called Sentiment analysis. Comparison of several machine learning, feature extraction approaches, and parameter optimization was done to achieve the best accuracy. This paper proposes an approach to extracting comparison value of sentiment review using three features extraction: Word2vec, Doc2vec, Terms Frequency-Inverse Document Frequency (TF-IDF) with machine learning classification algorithms, such as Support Vector Machine (SVM), Naive Bayes and Decision Tree. Grid search algorithm is used to optimize the feature extraction and classifier parameter. The performance of these classification algorithms is evaluated based on accuracy. The approach that is used in this research succeeded to increase the classification accuracy for all feature extractions and classifiers using grid search hyperparameter optimization on varied pre-processed data.

Authors and Affiliations

Sani Muhammad Isa, Rizaldi Suwandi, Yosefina Pricilia Andrean

Keywords

Related Articles

Enhancing Visualization of Multidimensional Data by Ordering Parallel Coordinates Axes

Every year business is overwhelmed by the quantity and variety of data. Visualization of Multi-dimensional data is counter-intuitive using conventional graphs. Parallel coordinates are proposed as an alternative to explo...

Privacy Impacts of Data Encryption on the Efficiency of Digital Forensics Technology

Owing to a number of reasons, the deployment of encryption solutions are beginning to be ubiquitous at both organizational and individual levels. The most emphasized reason is the necessity to ensure confidentiality of p...

Investigate the use of Anchor-Text and of Query-Document Similarity Scores to Predict the Performance of Search Engine

Query difficulty prediction aims to estimate, in advance, whether the answers returned by search engines in response to a query are likely to be useful. This paper proposes new predictors based upon the similarity betwee...

Creating a Knowledge Database for Lectures of Faculty Members, Proposed E-Module for Isra University

Higher education in Jordan is currently expanding as new universities open and compete for offering the best learning experience. Many universities face accreditation challenges, hence, they attend to recruit lecturers w...

Cost-effective and Green Manufacturing Substrate Integrated Waveguide (SIW) BPF for Wireless Sensor Network Applications

This paper presents a comparison between innovative technique for implementation of substrate integrated waveguide band pass filter centered at 4 GHz and conventional PCB results . Two poles filter is designed, simulated...

Download PDF file
  • EP ID EP498383
  • DOI 10.14569/IJACSA.2019.0100309
  • Views 115
  • Downloads 0

How To Cite

Sani Muhammad Isa, Rizaldi Suwandi, Yosefina Pricilia Andrean (2019). Optimizing the Hyperparameter of Feature Extraction and Machine Learning Classification Algorithms. International Journal of Advanced Computer Science & Applications, 10(3), 69-76. https://europub.co.uk/articles/-A-498383