Development of the method of automatic determination of the speaker gender on the basis of joint evaluation of frequency moments of basic tons and formant frequencies
Journal Title: Технологический аудит и резервы производства - Year 2018, Vol 3, Issue 2
Abstract
<p><em>The object of research is the methods of recognizing the speaker gender by means of speech signals. One of the most problematic places is insufficient knowledge of the choice of signs and decisive rules. This is necessary to increase the probability of correct recognition and noise immunity of gender recognition by voice signals in conditions of interference. It is also important to simplify the implementation of algorithms for recognizing the speaker gender.</em></p><p><em>For recognition of the speaker gender, a new set of classification characteristics is selected, including the joint use of estimates of the average value of the pitch frequency, its kurtosis coefficient, estimates of the mean values of the formants and their asymmetry coefficients. In the course of the research, the method of statistical testing of the proposed algorithms on a personal computer is used. The experiments are carried out using real audio signals input from a microphone into a personal computer for both female and male representatives, and recorded as separate files. For this purpose, 10 standards of 10 words are used for each of the 5 female speakers and 5 male speakers.</em></p><p><em>Based on the results of statistical tests for an algorithm involving the joint use of estimates of the mean value of the pitch frequency, its kurtosis coefficient, estimates of the mean values of the formants and their asymmetry coefficients, an average probability of correct recognition is obtained 1. With the additional action of additive noise of the Gaussian type, white noise and the ratio of the signal/noise q=20, for such algorithm the probability of correct recognition is experimentally obtained – 0.8. For the decision algorithm, which uses only estimates of the average value of the pitch frequency and its kurtosis coefficient, an average probability of correct recognition is estimated at 0.9. This indicates more noise immunity of such algorithms.</em></p><p><em>In the future, the use of the obtained results not only for Russian and Ukrainian languages, but also for a number of foreign languages is supposed.</em></p>
Authors and Affiliations
Sergey Omelchenko
Selection of horizon for forecasting innovative development of industrial enterprise
<p><em>The object of research is a complex self-regulating socio-economic meso-level system: enterprise. The article deals with industrial enterprise as a complex self-regulating management system of socio-economic facto...
Substatiation of quantitative criteria of structural parts and units manufacturability evaluation
<p><em>The object of research is assemblability and maintainability of structures. Criteria for assessing such important parameters of design manufacturability are an extremely complex problem in the design process. The...
Application of numerical simulation methods for reduction of aircrafts ice protection systems energy consumption
<p><em>The object of research is the processes of hydrodynamic and heat and mass transfer occurring during the icing of aircraft during flight in adverse meteorological conditions, as well as the system of protection aga...
Development of the design method of the enterprise for the release of new products
<p><em>The object of research is the method of designing an enterprise for the release of new products, the basis of which is played with nature and the method of analyzing hierarchies. The sustainability of the project...
The use of audit as a tool for strategic control of marketing activities of Polish enterprises
<p><em>The paper is focused on marketing audit within the marketing system control. Marketing audit is one of the tools applied to assess and improve the use of marketing in corporate activity.</em></p><p><em>One of the...