Isolated Automatic Speech Recognition of Quechua Numbers using MFCC, DTW and KNN

Abstract

The Automatic Speech (ASR) area is defined as the transformation of acoustic signals into string words. This area has been being developed for many year facilitating the lives of people so it was implemented in several languages. However, the development of ASR in some languages with few database resources but with a large population speaking these languages is very low. The development of ASR in Quechua language is almost null which leads culture and population isolation from technology and information. In this work an ASR system of isolated Quechua numbers is developed where Mel-Frequency Cepstral Coefficients (MFCC), Dynamic Time Warping (DTW) and K-Nearest Neighbor (KNN) methods are implemented using a database composed by recorded audio numbers from one to ten in Quechua. The recorded audios to feed the data base were uttered by natives man and women speakers of Quechua. The recognition accuracy reached in this research work was 91.1%.

Authors and Affiliations

Hernan Faustino Chacca Chuctaya, Rolfy Nixon Montufar Mercado, Jeyson Jesus Gonzales Gaona

Keywords

Related Articles

Feed Forward Neural Network Based Eye Localization and Recognition Using Hough Transform

Eye detection is a pre-requisite stage for many applications such as face recognition, iris recognition, eye tracking, fatigue detection based on eye-blink count and eye-directed instruction control. As the location of...

Smart Rubric-based Systematic Model for Evaluating and Prioritizing Academic Practices to Enhance the Education Outcomes

Recently, the impact of free-market economy, globalization, and knowledge economy has become a challenging and focal to higher educational institutions, which resulted in radical change. Therefore, it became mandatory fo...

Definition of Tactile Interactions for a Multi-Criteria Selection in a Virtual World

Tablets, smartphones are becoming increasingly common and interfaces are predominantly tactile and often multi-touch. More and more schools are testing them with their pupils in the hope of bringing pedagogic benefits. W...

Mapping Wheat Crop Phenology and the Yield using Machine Learning (ML)

Wheat has been a prime source of food for the mankind for centuries. The final wheat grain yield is the multitude of the complex interaction among the various yield attributes such as kernel per plant, Spike per plant, N...

Social Networks’ Benefits, Privacy, and Identity Theft: KSA Case Study

Privacy breaches and Identity Theft cases are increasing at an alarming rate. Social Networking Sites (SN’s) are making it worse. Facebook (FB), Twitter and other SN’s offer attackers a wide and easily accessible platfor...

Download PDF file
  • EP ID EP406797
  • DOI 10.14569/IJACSA.2018.091003
  • Views 73
  • Downloads 0

How To Cite

Hernan Faustino Chacca Chuctaya, Rolfy Nixon Montufar Mercado, Jeyson Jesus Gonzales Gaona (2018). Isolated Automatic Speech Recognition of Quechua Numbers using MFCC, DTW and KNN. International Journal of Advanced Computer Science & Applications, 9(10), 24-29. https://europub.co.uk/articles/-A-406797