A COMPATIVE STUDY OF SILENCE AND NON SILENCE REGIONS OF SPEECH SIGNAL USING PROSODY FEATURES FOR EMOTION RECOGNITION

Journal Title: Indian Journal of Computer Science and Engineering - Year 2016, Vol 7, Issue 4

Abstract

The objective of this work is the comparative study of the speech signal between the silence and non- silence regions of the speech signal. In this work our main goal is to observe the pitch contour, energy and duration are time-varying and also study how these changes play an important role in emotion recognition. An important step in emotion recognition from speech is to select significant features which carry large emotional information about the speech signal. It was given that emotion recognition from speech has different types of features, among them is prosody, spectral and acoustic features. Sometimes prosody features are called supra-segmental features. It deals with the auditory qualities of the sound and it can also reflect aspects of meaning, intention and emotional state of the characters [1] [2].Prosody Feature consists of more pitch information which is used in identifying the emotion such as Pitch, Energy, and Duration. In this work we also explored the importance of the speech signal which doesn’t have silence regions and how the signal varies due to of pitch contour, energy, duration values, to analyze their contribution towards the recognition of emotion. The main intention of this work was to utilize the speech properly by means of actual speech content, i.e., with no other silence parts or unnecessary parts of the signal.

Authors and Affiliations

J. Naga Padmaja , R. Rajeswara Rao

Keywords

Related Articles

AN IMPROVED DOMAIN CLASSIFICATION SCHEME BASED ON LOCAL FRACTAL DIMENSION

In fractal image compression, most of the time during encoding is spent for finding the best matching pair of range-domain blocks. Different techniques have been analyzed for decreasing the number of operations required...

REVIEW ANALYSIS OF THE ROUTING PROTOCOLS IN WIRELESS SENSOR NETWORKS FOR ENERGY OPTIMIZATION

Wireless sensor network consists of number of sensors, which collects the information and send to the sink node. Sensor node has limited energy storage and cannot be replaced in certain applications. A significant work h...

Analysis of Ternary and Binary High Resolution Codes Using MATLAB

It is feasible to achieve simultaneously superior performances in detection range and range resolution using the proposed Chebyshev mapping based binary and ternary codes. The performance parameter for the high resolutio...

Handling Uncertain and Ambiguous Spatial Expressions in Text Using Fuzzy Logic

The knowledge era, as this era is called, poses the challenge of churning knowledge from the pool of information available from various sources. Much of the text documents acting as an information source and the query po...

LIGHT WEIGHT SECURITY AND AUTHENTICATION IN WIRELESS BODY AREA NETWORK

In recent year, the increasing number of wearable sensors on human can serve for many purposes like emergency care, health care remote monitoring, personal entertainment and communication etc. The healthcare application...

Download PDF file
  • EP ID EP144202
  • DOI -
  • Views 107
  • Downloads 0

How To Cite

J. Naga Padmaja, R. Rajeswara Rao (2016). A COMPATIVE STUDY OF SILENCE AND NON SILENCE REGIONS OF SPEECH SIGNAL USING PROSODY FEATURES FOR EMOTION RECOGNITION. Indian Journal of Computer Science and Engineering, 7(4), 153-161. https://europub.co.uk/articles/-A-144202