Analysis of the influence of sound signal processing parameters on the quality voice command recognition
Journal Title: Вісник НТУУ КПІ. Серія Радіотехніка, Радіоапаратобудування - Year 2014, Vol 0, Issue 56
Abstract
Introduction. For the task of voice control over different devices recognition of single (isolated) voice commands is required. Typically, this control method requires high reliability (at least 95% accuracy voice recognition). It should be noted that voice commands are often pronounced in high noisiness. All presently known methods and algorithms of speech recognition do not allow to clearly determine which parameters of sound signal can provide the best results. The main part. On the first level of voice recognition is about preprocessing and extracting of acoustic features that have a number of useful features – they are easily calculated, providing a compact representation of the voice commands that are resistant to noise interference; On the next level given command is looked for in the reference dictionary. To get MFCC coefficients input file has to be divided into frames. Each frame is measured by a window function and processed by discrete Fourier transform. The resulting representation of signal in the frequency domain is divided into ranges using a set of triangular filters. The last step is to perform discrete cosine transform. Method of dynamic time warping allows to get a value that is an inverse of degree of similarity between given command and a reference. Conclusions. Research has shown that in the field of voice commands recognition optimum results in terms of quality / performance can be achieved using the following parameters of sound signal processing:8 kHz sample rate, frame duration 70–120 ms, Hamming weighting function of a window, number of Fourier samples is 512.
Authors and Affiliations
L. Dyuzhayev, V. Koval
Power distribution in the electromagnetic field. Portions of energy flows. portions energy
New concepts of the phase velocity of the energy flux density, the phase velocity of the energy density, the concept of a portion of the energy flux density, portions of the energy density are introduced. The entered val...
Analysis of firmness of different strategies of optimization of analog circuits
The problem of designing of analog circuits is presented in the generalized formulation. Generalization consists in possibility of refusal of performance of laws of Kirchhoff during designing process at their uncondition...
Signal processing domain-acoustic processor in the autocorrelation mode
It is analyzed the signal / noise ratio, which can be achieved by the signal processing domain-acoustic processor. It is shown that the quality of signal processing by this method ideally determined by the quality of the...
Monolithic microwave duplexer
The results of development monolithic dielectric duplexer with the increased attenuation between channels are given, that is achieved asymmetric amplitude-frequency characteristics of the channel filters.
Comparison of Methods for Efficiency Indexes Estimation of Behavior Algorithms of Radioelectronic Complex Systems
Introduction. Nowadays it is actual task to provide the necessary efficiency indexes of radioelectronic complex system by its behavior algorithm design. There are several methods using for solving this task, intercompari...