Emotion Recognition from Speech using Prosodic and Linguistic Features

Abstract

Speech signal can be used to extract emotions. However, it is pertinent to note that variability in speech signal can make emotion extraction a challenging task. There are a number of factors that indicate presence of emotions. Prosodic and temporal features have been used previously for the purpose of identifying emotions. Separately, prosodic/temporal and linguistic features of speech do not provide results with adequate accuracy. We can also find out emotions from linguistic features if we can identify contents. Therefore, We consider prosodic as well as temporal or linguistic features which help increasing accuracy of emotion recognition, which is our first contribution reported in this paper. We propose a two-step model for emotion recognition; we extract emotions based on prosodic features in the first step. We extract emotions from word segmentation combined with linguistic features in the second step. While performing our experiments, we prove that the classification mechanisms, if trained without considering age factor, do not help improving accuracy. We argue that the classifier should be based on the age group on which the actual emotion extraction be required, and this becomes our second contribution submitted in this paper.

Authors and Affiliations

Mahwish Pervaiz, Tamim Khan

Keywords

Related Articles

Detection and Removal of Gray, Black and Cooperative Black Hole Attacks in AODV Technique

Mobile ad hoc network (MANET) is an autonomous self-configuring infrastructure-less wireless network. MANET is vulnerable to a lot of routing security threats due to unreliability of its nodes that are highly involved in...

Online Monitoring System Design of Intelligent Circuit Breaker Based on DSP and ARM

In order to accurately analyze the dynamic characteristics of the vacuum circuit breaker, a dual-core master-slave processor structure for online monitoring system based on DSP and ARM is proposed. This structure consist...

Intrusion Detection System in Wireless Sensor Networks: A Review

The security of wireless sensor networks is a topic that has been studied extensively in the literature. The intrusion detection system is used to detect various attacks occurring on sensor nodes of Wireless Sensor Netwo...

An Evaluation Model for Auto-generated Cognitive Scripts

Autonomous intelligent agents have become a very important research area in Artificial Intelligence (AI). Socio-cultural situations are one challenging area in which autonomous intelligent agents can acquire new knowledg...

Novel Conception of a Tunable RF MEMS Resonator

This paper presents a new monolithic microwave integrated circuit (MMIC) based on coplanar waveguide (CPW) design for a tunable resonator based on RF MEMS. This RF structure, which can be used for system on chip (SOC), i...

Download PDF file
  • EP ID EP96329
  • DOI 10.14569/IJACSA.2016.070813
  • Views 79
  • Downloads 0

How To Cite

Mahwish Pervaiz, Tamim Khan (2016). Emotion Recognition from Speech using Prosodic and Linguistic Features. International Journal of Advanced Computer Science & Applications, 7(8), 84-90. https://europub.co.uk/articles/-A-96329