Multi-class sentiment analysis using a hierarchical logistic model tree approach
Journal Title: MASKANA - Year 2014, Vol 5, Issue 5
Abstract
This paper proposes a new hybrid system for multi-class sentiment analysis based on General Inquirer (GI) dictionary and a hierarchical Logistic Model Tree (LMT) approach. This new system consists of three layers, the Bipolar Layer (BL) is of one LMT (LMT-1) for classifying sentiment polarity, while the Intensity Layer (IL) comprises two LTMs (LMT-2 and LMT3) for detecting separately three positive and three negative sentiment intensities. Only in construction phase, the Grouping Layer (GL) is used to cluster positive and negative instances by employing 2 k-means respectively. In Preprocessing phase, the raw text data is subjected to a tokenizer, a tagger, a stemmer and finally to GI dictionary to count and label only verbs, nouns, adjectives and adverbs with 24 markers that are used later to compute feature vectors. In Sentiments Classification phase, feature vectors are first introduced to LMT-1, then they are grouped in GL according to class label, afterward these groups of instances are labeled manually, and finally positive instances are introduced to LMT-2 and negative instances to LMT-3. The three trees are trained and tested on Movie Review and SenTube datasets utilizing 10- folds stratified cross validation. LMT-1 yields a tree of 48 leaves and 95 of size with 90.88% of accuracy, while both LMT-2 and LMT-3 provide two trees of 1 leaf and 1 of size with 99.28% and 99.37% of accuracy respectively. Experiments show that the proposed hierarchical classification methodology gives a better performance compared to other prevailing approaches.
Authors and Affiliations
Masun Nabhan Homsi
La clasificación de universidades como herramienta de gestión universitaria
El reciente debate sobre la clasificación de universidades ha producido preocupación entre los miembros de la comunidad académica ecuatoriana. El que las universidades ecuatorianas no produzcan ganadores de Premios Nobel...
Caracterización de señales sísmicas del Volcán Cotopaxi utilizando estimadores espectrales clásicos y de máxima entropía
Se presenta un estudio de detección y caracterización de eventos sísmicos del tipo volcano tectónicos y largo periodo de registros sísmicos generados por el volcán Cotopaxi. La estructura secuencial de detección propue...
A preliminary response from the Faculty of Psychology students of the University of Cuenca to the modified EFL teaching approach
English teachers in Ecuadorian universities, like teachers in many non-native English-speaking countries, face the challenge of dealing with uninterested, unmotivated students, even when intermediate proficiency of Eng...
Efecto de un tranquilizante sobre las características seminales de toros colectados con electroeyaculador
Entre las técnicas actuales para la colecta de semen de especímenes bovinos están el uso de una vagina artificial (AV), el electroeyaculador (EE) y el masaje transrectal (MTR) de las glándulas sexuales accesorias. El u...
Un enfoque para la integración de dispositivos IoT en el desarrollo de SIG en la nube
Cloud computing and Internet of Things (IoT) as technological support to the construction of Geographic Information Systems (GIS) is changing the way these systems are developed and employed. The technological infrastr...