Helpful Statistics in Recognizing Basic Arabic Phonemes
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2017, Vol 8, Issue 2
Abstract
The recognition of continuous speech is one of the main challenges in the building of automatic speech recognition (ASR) systems, especially when it comes to phonetically complex languages such as Arabic. An ASR system seems to be actually in a blocked alley. Nearly all solutions follow the same general model. The previous research focused on enhancing its performance by incorporating supplementary features. This paper is part of ongoing research efforts aimed at developing a high-performance Arabic speech recognition system for learning and teaching purposes. It investigates a statistical analysis of certain distinctive features of the basic Arabic phonemes which seems helpful in enhancing the performance of a baseline HMM-based ASR system. The statistics are collected using a particular Arabic speech database, which involves ten different male speakers and more than eight hours of speech which covers all Arabic phonemes. In HMM modeling framework, the statistics provided are helpful in establishing the appropriate number of HMM states for each phoneme and they can also be utilized as an initial condition for the EM estimation procedure, which generally, accelerates the estimation process and, thus, improves the performance of the system. The obtained findings are presented and possible applications of automatic speech recognition and speaker identification systems are also suggested.
Authors and Affiliations
Mohamed O. M. Khelifa, Yousfi Abdellah, Yahya O. M. ElHadj, Mostafa Belkasmi
Development and Evaluation of Massive Open Online Course (MOOC) as a Supplementary Learning Tool: An Initial Study
The popularity of Massive Open Online Courses (MOOCs) is prevalent among researchers and practitioners as a new paradigm of open education resource. Since the development of this technology may entail enormous investment...
Automatic Keyphrase Extractor from Arabic Documents
The keyphrase is a sentence or a part of a sentence that contains a sequence of words that expresses the meaning and the purpose of any given paragraph. Keyphrase extraction is the task of identifying the possible keyphr...
An Efficient Method For Multichannel Wireless Mesh Networks With Pulse Coupled Neural Network
Multi cast communication is a key technology for wireless mesh networks. Multicast provides efficient data distribution among a group of nodes, Generally sensor networks and MANETs uses multicast algorithms which...
Automated Greenhouses for the Reduction of the Cost of the Family Basket in the District of Villa El Salvador-Perú
Today, the cost of the family basket is gradually increasing, not only globally but also in our country. This increase includes the demand for vegetables and fresh vegetables that allow people to improve their quality of...
Flood Analysis in Peru using Satellite Image: The Summer 2017 Case
At the beginning of the year 2017, different regions of Peru suffered from heavy rains mainly due to the 'El Niño' and 'La Niña' phenomena. As a result of these massive storms, several cities were affected by overflows a...