Vietnamese Speech Command Recognition using Recurrent Neural Networks

Abstract

Voice control is an important function in many mobile devices, in a smart home, especially in providing people with disabilities a convenient way to communicate with the device. Despite many studies on this problem in the world, there has not been a formal study for the Vietnamese language. In addition, many studies did not offer a solution that can be expanded easily in the future. During this study, a dataset of Vietnamese speech commands is labeled and organized to be shared with community of general language research and Vietnamese language study in particular. This paper provides a speech collection and processing software. This study also designs and evaluates Recurrent Neural Networks to apply it to the data collected. The average recognition accuracy on the set of 15 commands for controlling smart home devices is 98.19%.

Authors and Affiliations

Phan Duy Hung, Truong Minh Giang, Le Hoang Nam, Phan Minh Duong

Keywords

Related Articles

An Empirical Investigation of Predicting Fault Count, Fix Cost and Effort Using Software Metrics

Software fault prediction is important in software engineering field. Fault prediction helps engineers manage their efforts by identifying the most complex parts of the software where errors concentrate. Researchers usua...

Applicability of Data Mining Technique Using Bayesians Network in Diagnosis of Genetic Diseases

This study aims to identify a methodology to aid in the identification of diagnosis for chromosomal abnormalities and genetic diseases, presenting as a tutorial model the Turner Syndrome. So, it has been used classificat...

Performance Evaluation of Network Gateway Design for NoC based System on FPGA Platform

Network on Chip (NoC) is an emerging interconnect solution with reliable and scalable features over the System on Chip (SoC) and helps to overcome the drawbacks of bus-based interconnection in SoC. The multiple cores or...

Enhanced Random Early Detection using Responsive Congestion Indicators

Random Early Detection (RED) is an Active Queue Management (AQM) method proposed in the early 1990s to reduce the effects of network congestion on the router buffer. Although various AQM methods have extended RED to enha...

An Approach to Calculate the Efficiency for an N-Receiver Wireless Power Transfer System

A wireless power transfer system with more than one receiver is a realistic proposition for charging multiple devices such as phones and a tablets. Therefore, it is necessary to consider systems with single transmitters...

Download PDF file
  • EP ID EP611261
  • DOI 10.14569/IJACSA.2019.0100728
  • Views 79
  • Downloads 0

How To Cite

Phan Duy Hung, Truong Minh Giang, Le Hoang Nam, Phan Minh Duong (2019). Vietnamese Speech Command Recognition using Recurrent Neural Networks. International Journal of Advanced Computer Science & Applications, 10(7), 194-201. https://europub.co.uk/articles/-A-611261