Local Feature based Gender Independent Bangla ASR
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2012, Vol 1, Issue 8
Abstract
This paper presents an automatic speech recognition (ASR) for Bangla (widely used as Bengali) by suppressing the speaker gender types based on local features extracted from an input speech. Speaker-specific characteristics play an important role on the performance of Bangla automatic speech recognition (ASR). Gender factor shows adverse effect in the classifier while recognizing a speech by an opposite gender, such as, training a classifier by male but testing is done by female or vice-versa. To obtain a robust ASR system in practice it is necessary to invent a system that incorporates gender independent effect for particular gender. In this paper, we have proposed a Gender-Independent technique for ASR that focused on a gender factor. The proposed method trains the classifier with the both types of gender, male and female, and evaluates the classifier for the male and female. For the experiments, we have designed a medium size Bangla (widely known as Bengali) speech corpus for both the male and female.The proposed system has showed a significant improvement of word correct rates, word accuracies and sentence correct rates in comparison with the method that suffers from gender effects using. Moreover, it provides the highest level recognition performance by taking a fewer mixture component in hidden Markov model (HMMs).
Authors and Affiliations
Bulbul Ahamed , Khaled Mahmud , B. K. M. Mizanur Rahman , Foyzul Hassan , Rasel Ahmed , Mohammad Nurul Huda
Static Gesture Recognition Combining Graph and Appearance Features
In this paper we propose the combination of graph-based characteristics and appearance-based descriptors such as detected edges for modeling static gestures. Initially we convolve the original image with a Gaussian...
Relations between Psychological Status and Eye Movements
Relations between psychological status and eye movements are found through experiments with readings of different types of documents as well as playing games. Psychological status can be monitored with Electroencep...
Automatic Recognition of Human Parasite Cysts on Microscopic Stools Images using Principal Component Analysis and Probabilistic Neural Network
Parasites live in a host and get its food from or at the expensive of that host. Cysts represent a form of resistance and spread of parasites. The manual diagnosis of microscopic stools images is time-consuming and...
Evacuation Path Selection for Firefighters Based on Dynamic Triangular Network Model
Path selection is one of the critical aspects in emergency evacuation. In a fire scene, how to choose an optimal evacuation path for firefighters is a challenging aspect. In this paper, firstly, a dynamic triangular netw...
Changes in Known Statements After New Data is Added
Learning spaces are broadly defined as spaces with a noteworthy bearing on learning. They can be physical or virtual, as well as formal and informal. The formal ones are customary understood to be traditional class...