Speaker Identification using Row Mean of Haar and Kekre’s Transform on Spectrograms of Different Frame Sizes 

Abstract

In this paper, we propose Speaker Identification using two transforms, namely Haar Transform and Kekre’s Transform. The speech signal spoken by a particular speaker is converted into a spectrogram by using 25% and 50% overlap between consecutive sample vectors. The two transforms are applied on the spectrogram. The row mean of the transformed matrix forms the feature vector, which is used in the training as well as matching phases. The results of both the transform techniques have been compared. Haar transform gives fairly good results with a maximum accuracy of 69% for both 25% as well as 50% overlap. Kekre’s Transform shows much better performance, with a maximum accuracy of 85.7% for 25% overlap and 88.5% accuracy for 50% overlap.

Authors and Affiliations

Dr. H B Kekre , Vaishali Kulkarni

Keywords

Related Articles

A Survey on using Neural Network based Algorithms for Hand Written Digit Recognition

The detection and recognition of handwritten content is the process of converting non-intelligent information such as images into machine edit-able text. This research domain has become an active research area due to vas...

Optimizing the Hyperparameter of Feature Extraction and Machine Learning Classification Algorithms

The process of assigning a quantitative value to a piece of text expressing a mood or effect is called Sentiment analysis. Comparison of several machine learning, feature extraction approaches, and parameter optimization...

KASP: A Cognitive-Affective Methodology for Designing Serious Learning Games

Many research studies agree on the existence of a close link between emotion and cognition. Actually, much research has demonstrated that students with learning disabilities (LD) experience emotional distress related to...

Design and Implementation on a Sub-band based Acoustic Echo Cancellation Approach

 This paper describes the design and implementation of a sub-band based acoustic echo cancellation approach, which incorporates the normalized least mean square algorithm and the double talk detection algorithm. Acc...

Achieving Regulatory Compliance for Data Protection in the Cloud

The advent of cloud computing has enabled organizations to take advantage of cost-effective, scalable and reliable computing platforms. However, entrusting data hosting to third parties has inherent risks. Where the data...

Download PDF file
  • EP ID EP139957
  • DOI -
  • Views 115
  • Downloads 0

How To Cite

Dr. H B Kekre, Vaishali Kulkarni (2011). Speaker Identification using Row Mean of Haar and Kekre’s Transform on Spectrograms of Different Frame Sizes . International Journal of Advanced Computer Science & Applications, 2(9), 6-12. https://europub.co.uk/articles/-A-139957