Speech Intelligibility Improvement in Noisy Environments for Near-End Listening Enhancement
Journal Title: Journal of Information Systems and Telecommunication - Year 2016, Vol 4, Issue 1
Abstract
A new speech intelligibility improvement method for near-end listening enhancement in noisy environments is proposed. This method improves speech intelligibility by optimizing energy correlation of one-third octave bands of clean speech and enhanced noisy speech without power increasing. The energy correlation is determined as a cost function based on frequency band gains of the clean speech. Interior-point algorithm which is an iterative procedure for the nonlinear optimization is used to determine the optimal points of the cost function because of nonlinearity and complexity of the energy correlation function. Two objective intelligibility measures, speech intelligibility index and short-time objective intelligibility measure, are employed to evaluate the noisy enhanced speech intelligibility. Furthermore, the speech intelligibility scores are compared with unprocessed speech and a baseline method under various noisy conditions. The results show large intelligibility improvements with the proposed method over the unprocessed noisy speech.
Authors and Affiliations
Peyman Goli, Mohammad Reza Karami-Mollaei
SRR shape dual band CPW-fed monopole antenna for WiMAX / WLAN applications
CPW structure is became common structure for UWB and multi band antenna design and SRR structure is well-known kind of metamaterial that has been used in antenna and filter design for multi band application. In this pape...
The Separation of Radar Clutters using Multi-Layer Perceptron
Clutter usually has negative influence on the detection performance of radars. So, the recognition of clutters is crucial to detect targets and the role of clutters in detection cannot be ignored. The design of radar det...
Pose-Invariant Eye Gaze Estimation Using Geometrical Features of Iris and Pupil Images
In the cases of severe paralysis in which the ability to control the body movements of a person is limited to the muscles around the eyes, eye movements or blinks are the only way for the person to communicate. Interface...
Fusion Infrared and Visible Images Using Optimal Weights
Image fusion is a process in which different images recorded by several sensors from one scene are combined to provide a final image with higher quality compared to each individual input image. In fact, combination of di...
Design of Fall Detection System: A Dynamic Pattern Approach with Fuzzy Logic and Motion Estimation
Every year thousands of the elderly suffer serious damages such as articular fractures, broken bones and even death due to their fall. Automatic detection of the abnormal walking in people, especially such accidents as t...