Acoustic Model Training, using Kaldi, for Automatic Whispery Speech Recognition

Journal Title: Annals of Computer Science and Information Systems - Year 2018, Vol 16, Issue

Abstract

The article presents research on the automatic whispery speech recognition. The main task was to find dependences between a number of triphone classes (number of leaves in decision tree) and the total number of Gaussian distributions and therefore, to determine optimal values, for which the quality of speech recognition is best. Moreover, it was found, how these dependences differ between normal and whispery speech, what was not done earlier, and this is the innovative part of this work. Based on the performed experiments and obtained results one can say that the number of triphone classes (number of leaves) for whispered speech should be significantly lower than for normal speech.

Authors and Affiliations

Piotr Kozierski, Talar Sadalla, Szymon Drgas, Adam Dąbrowski, Joanna Ziętkiewicz, Wojciech Giernacki

Keywords

Related Articles

A Detailed Study of EEG based Brain Computer Interface

Brain Computer Interface (BCI) generate a direct method to communicate with the outside world. Many patients are not able to communicate. For example:- the patient who are suffered with the several disease like post stro...

"Passeport Vacances": an assignment problem with cost balancing

asseport Vacances is an offer for school-aged children to discover a set of activities during holidays. For more than 30 years, it has been an established social function in several countries, including Germany and Switz...

Development of a mathematical model for electrode systems in rheoophthalmography

The problem of estimating the electrical impedance characteristics was solved using the system of impedance diagnostics of blood circulation with the help of mathematical modeling. In this work, the geometry for mathemat...

Robotic Process Automation of Unstructured Data with Machine Learning

In this paper we present our work in progress on building an artificial intelligence system dedicated to tasks regarding the processing of formal documents used in various kinds of business procedures. The main challenge...

An Approach towards economical hierarchic Search over Encrypted Cloud

In display, Cloud registering is the prevailing area in data innovation. With expanded value of information outsourcing of cloud information protection of delicate information turns into a major issue. For the security r...

Download PDF file
  • EP ID EP568223
  • DOI 10.15439/2018F255
  • Views 34
  • Downloads 0

How To Cite

Piotr Kozierski, Talar Sadalla, Szymon Drgas, Adam Dąbrowski, Joanna Ziętkiewicz, Wojciech Giernacki (2018). Acoustic Model Training, using Kaldi, for Automatic Whispery Speech Recognition. Annals of Computer Science and Information Systems, 16(), 109-114. https://europub.co.uk/articles/-A-568223