SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM

Apply

SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM

Journal Title: International Journal on Computer Science and Engineering - Year 2010, Vol 2, Issue 5

Abstract

This paper aims to provide different approaches to text dependent speaker identification using DCT, Walsh and Haar transform along with use of spectrograms. Spectrograms obtained from speech samples are used as image database for the study undertaken. This image database is then bjected to various transforms. Using Euclidean distance as measure of similarity, most appropriate speaker match is btained and is declared as identified speaker. Each transform is applied to spectrograms in two different ways: on full image and on mage blocks. In both the ways, effect of different umber of coefficients of transformed image is observed. Haar transform on full image reduces multiplications required by DCT and Walsh by 28 times whereas applying Haar transform on image blocks requires 18 times less mathematical omputations as compared to DCT and Walsh on image blocks. Transforms when applied to image blocks, yield better or equal identification rates with educed computational complexity.

Authors and Affiliations

Dr. H. B. Kekre , Dr. Tanuja K. Sarode , Shachi J. Natu , Prachi J. Natu

Keywords

Speaker identification; Speaker Recognition; Spectrograms; DCT; WALSH; HAAR; Image Blocks

E–Learning Using Mapreduce

E-Learning is the learning process created by interaction with digitally delivered content, services and support. Learner’s profile plays a crucial role in the evaluation process and to improve the elearning process. The...

Multi-party Quantum Communication in biological Cells

Enhancing Security Of Agent-Oriented Techniques Programs Code Using Jar Files

Agent-oriented techniques characterize an exciting new way of analyzing, designing and building complex software systems in real time world. These techniques have the prospective to significantly improve current practice...

Multi Subgroup Data Compression Technique Using Switch Code

Data compression is the art of converting a data stream into a small in size data bits so that it can be easily travel a long istance without increasing load of its volume on a constant Bandwidth channel regardless of i...

Analysis and Comparative Study of Clock Synchronization Schemes in Wireless Sensor Networks

Time synchronization is an important issue in wireless sensor networks. Many applications based on these WSNs assume local clocks at each sensor node that need to be synchronized to a common notion of time. Some intrinsi...

EP ID EP91894
DOI -
Views 144
Downloads 0

How To Cite

Dr. H. B. Kekre, Dr. Tanuja K. Sarode, Shachi J. Natu, Prachi J. Natu (2010). SPEAKER IDENTIFICATION USING 2-D DCT, WALSH AND HAAR ON FULL AND BLOCK SPECTROGRAM. International Journal on Computer Science and Engineering, 2(5), 1733-1740. https://europub.co.uk/articles/-A-91894