Analysis of the influence of sound signal processing parameters on the quality voice command recognition

Abstract

Introduction. For the task of voice control over different devices recognition of single (isolated) voice commands is required. Typically, this control method requires high reliability (at least 95% accuracy voice recognition). It should be noted that voice commands are often pronounced in high noisiness. All presently known methods and algorithms of speech recognition do not allow to clearly determine which parameters of sound signal can provide the best results. The main part. On the first level of voice recognition is about preprocessing and extracting of acoustic features that have a number of useful features – they are easily calculated, providing a compact representation of the voice commands that are resistant to noise interference; On the next level given command is looked for in the reference dictionary. To get MFCC coefficients input file has to be divided into frames. Each frame is measured by a window function and processed by discrete Fourier transform. The resulting representation of signal in the frequency domain is divided into ranges using a set of triangular filters. The last step is to perform discrete cosine transform. Method of dynamic time warping allows to get a value that is an inverse of degree of similarity between given command and a reference. Conclusions. Research has shown that in the field of voice commands recognition optimum results in terms of quality / performance can be achieved using the following parameters of sound signal processing:8 kHz sample rate, frame duration 70–120 ms, Hamming weighting function of a window, number of Fourier samples is 512.

Authors and Affiliations

L. Dyuzhayev, V. Koval

Keywords

Related Articles

Мethods of reduction of the peak-factor in channels with OFDM.

In given article the basic methods of reduction of the peak-factor in systems which use OFDM signals are considered.

Student’s search for scientific and technical information

The possibilities that exist in the National Technical University of Ukraine "Kyiv Polytechnic Institute" for access to scientific and technical information in the fields of radio engineering and electronics are consider...

Spectrum of radiation of the partial discharge in the dielectrics

In the article the analysis of the method of study the spectrum of partial discharge in high voltage equipment

Origin and development of scientific method

In the article short-story history of origin and development of scientific method of cognition of nature is presented from the most ancient times to our days. It is marked that he was folded during great while and only a...

Device for the increase of duration of radio signals

The device for The increase of duration of radio signals is considered by the method of phase concordance of frequency of filling due to changeable time of delay for the increase of measuring exactness. The results of ex...

Download PDF file
  • EP ID EP308954
  • DOI 10.20535/RADAP.2014.56.34-41
  • Views 84
  • Downloads 0

How To Cite

L. Dyuzhayev, V. Koval (2014). Analysis of the influence of sound signal processing parameters on the quality voice command recognition. Вісник НТУУ КПІ. Серія Радіотехніка, Радіоапаратобудування, 0(56), 34-41. https://europub.co.uk/articles/-A-308954