SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5

Abstract

This paper aims to provide different approaches to text dependent speaker identification using DCT, Walsh and Haar transform along with use of spectrograms. Spectrograms obtained from speech samples are used as image database for the study undertaken. This image database is then bjected to various transforms. Using Euclidean distance as measure of similarity, most appropriate speaker match is btained and is declared as identified speaker. Each transform is applied to spectrograms in two different ways: on full image and on mage blocks. In both the ways, effect of different umber of coefficients of transformed image is observed. Haar transform on full image reduces multiplications required by DCT and Walsh by 28 times whereas applying Haar transform on image blocks requires 18 times less mathematical omputations as compared to DCT and Walsh on image blocks. Transforms when applied to image blocks, yield better or equal identification rates with educed computational complexity.

Authors and Affiliations

Dr. H. B. Kekre , Dr. Tanuja K. Sarode , Shachi J. Natu , Prachi J. Natu

Keywords

Related Articles

IMPROVED ROUND ROBIN POLICY A MATHEMATICAL APPROACH

This work attempts to mathematically formulize the computation of waiting time of any process in a static -process, CPU-bound round robin scheme. That in effect, can calculate other performance measures also. An improv...

Clustered Chain based Power Aware Routing (CCPAR) Scheme for Wireless Sensor Networks

Wireless sensor networks with thousands of tiny sensor nodes are becoming immensely popular due to their wide applicability in multitude of applications such as monitoring and collecting data from unattended hazardous en...

Building Personalized and Non Personalized Recommendation Systems

The contents of e-Commerce such as music, movies, books and electronics goods are necessary for a modern life style. But, it becomes difficult to find content according to users likes and users preference. An approach wh...

Effect of Varying Node Density and Routing Zone Radius in ZRP: A Simulation Based Approach

The Zone Routing Protocol (ZRP) is a hybrid routing protocol for MANET which combines the advantages of the proactive and reactive approaches by maintaining an up-to-date topological map of a zone centered on each node....

Knowledge-driven Intuitionistic Fuzzy Decision Support for finding out the causes of Obesity

In this paper, we propose an Intuitionistic Fuzzy decision support system for determining the causes of a more or less common disorder – obesity. Obesity can be simply lifestyle related i.e. excessive food energy intake...

Download PDF file
  • EP ID EP91894
  • DOI -
  • Views 155
  • Downloads 0

How To Cite

Dr. H. B. Kekre, Dr. Tanuja K. Sarode, Shachi J. Natu, Prachi J. Natu (2010). SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM. International Journal on Computer Science and Engineering, 2(5), 1733-1740. https://europub.co.uk/articles/-A-91894