An Improved Approach for Text-Independent Speaker Recognition

Abstract

This paper presents new Speaker Identification and Speaker Verification systems based on the use of new feature vectors extracted from the speech signal. The proposed structure combine between the most successful Mel Frequency Cepstral Coefficients and new features which are the Short Time Zero Crossing Rate of the signal. A comparison between speaker recognition systems based on Gaussian mixture models using the well known Mel Frequency Cepstral Coefficients and the novel systems based on the use of a combination between both reduced Mel Frequency Cepstral Coefficients features vectors and Short Time Zero Crossing Rate features is given. This comparison proves that the use of the new reduced feature vectors help to improve the system’s performance and also help to reduce the time and memory complexity of the system which is required for realistic applications that suffer from computational resource limitation. The experiments were performed on speakers from TIMIT database for different training durations. The suggested systems performances are evaluated against the baseline systems. The increase of the proposed systems performances are well observed for identification experiments and the decrease of Equal Error Rates are also remarkable for verification experiments. Experimental results demonstrate the effectiveness of the new approach which avoids the use of more complex algorithms or the combination of different approaches requiring lengthy calculation.

Authors and Affiliations

Rania Chakroun, Leila Zouari, Mondher Frikha

Keywords

Related Articles

An approach for Teaching of National Languages and Cultures through ICT in Cameroon

This article describes the input of ICT to the modernization of teaching national languages and cultures in order to promote cultural diversity as well as dissemination of scientific knowledge through national languages....

Internet Forensics Framework Based-on Clustering

Internet network attacks are complicated and worth studying. The attacks include Denial of Service (DoS). DoS attacks that exploit vulnerabilities found in operating systems, network services and applications. Indicators...

Features and Potential Security Challenges for IoT Enabled Devices in Smart City Environment

Introduction of Internet of Things in our lives have brought drastic changes in the social norms, working habits, ways of completing tasks and planning for future. Data about our interactions with everyday objects can be...

The Design and Evaluation of a User-Centric Information Security Risk Assessment and Response Framework

The risk of sensitive information disclosure and modification through the use of online services has increased considerably and may result in significant damage. As the management and assessment of such risks is a well-k...

Classification of Affective States via EEG and Deep Learning

Human emotions play a key role in numerous decision-making processes. The ability to correctly identify likes and dislikes as well as excitement and boredom would facilitate novel applications in neuromarketing, affectiv...

Download PDF file
  • EP ID EP128611
  • DOI 10.14569/IJACSA.2016.070846
  • Views 83
  • Downloads 0

How To Cite

Rania Chakroun, Leila Zouari, Mondher Frikha (2016). An Improved Approach for Text-Independent Speaker Recognition. International Journal of Advanced Computer Science & Applications, 7(8), 343-348. https://europub.co.uk/articles/-A-128611