Upgrading the Performance of Speech Emotion Recognition at  the Segmental Level

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2013, Vol 15, Issue 3

Abstract

 This paper presents an efficient approach for maximizing the accuracy of automatic speech emotion recognition in English, using minimal inputs, minimal features, lesser algorithmic complexity and reduced processing time. Whereas the findings reported here are based on the exclusive use of vowel formants, most of  the related previous works used tens or even hundreds of other features. In spite of using a greater level of  signal processing, the recognition accuracy reported earlier was often lesser than that obtained by our  approach. This method is based on vowel utterances and the first step comprises statistical pre-processing of  the vowel formants. This is followed by the identification of the best formants using the KMeans, K-nearest  neighbor and Naive Bayes classifiers. The Artificial neural network that was used for the final classification  gave an accuracy of 95.6% on elicited emotional speech. Nearly 1500 speech files from ten female speakers in  the neutral and six basic emotions were used to prove the efficiency of the proposed approach. Such a result  has not been reported earlier for English and is of significance to researchers, sociologists and others interested  in speech.

Authors and Affiliations

Agnes Jacob

Keywords

Related Articles

 Towards Accurate Estimation of Fingerprint Ridge Orientation  Using BPNN and Ternarization

 Accurate estimation of ridge orientation is a crucial step in fingerprint image enhancement because the performance of a minutiae extraction algorithm and matching heavily relies on the quality of the input fin...

 Mathematical Programming Approach to Improve WebsiteStructure for Effective User Navigation

 Abstract:Due to tremendous growth of web applications. It increases the complexity of web applications andweb navigation. Designing well-structured website has been long challenge because while creating website web...

Multicast and Unicast Communication in Vehicular Network Using IPv6.

Now a days the research is going on in Vehicular ad-hoc network to make a city as Smart city for the driver and passengers to navigate and to obtain relevant information and avoid the traffic accident. In the proposed sy...

 Route maintenance and Scalability improvement of DSR, based on Relay node identification after locating Link-failure over MANET

 Abstract: In Dynamic Source Routing, each source determines the route to be used in transmitting its packets to destination. Route Discovery determines the optimum path for a transmission between a given source and...

 Analysis of Image and Video Using Color, Texture and ShapeFeatures for Object Identification

 Abstract: The recent developments in cognitive sciences and Artificial Intelligence has made possible theprocess of automation in object detection and recognition using various low level visual features used torepr...

Download PDF file
  • EP ID EP104707
  • DOI -
  • Views 95
  • Downloads 0

How To Cite

Agnes Jacob (2013).  Upgrading the Performance of Speech Emotion Recognition at  the Segmental Level. IOSR Journals (IOSR Journal of Computer Engineering), 15(3), 48-52. https://europub.co.uk/articles/-A-104707