Speech Activity Detection and its Evaluation in Speaker Diarization System

Journal Title: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY - Year 2017, Vol 16, Issue 1

Abstract

In speaker diarization, the speech/voice activity detection is performed to separate speech, non-speech and silent frames. Zero crossing rate and root mean square value of frames of audio clips has been used to select training data for silent, speech and nonspeech models. The trained models are used by two classifiers, Gaussian mixture model (GMM) and Artificial neural network (ANN), to classify the speech and non-speech frames of audio clip. The results of ANN and GMM classifier are compared by Receiver operating characteristics (ROC) curve and Detection ErrorTradeoff (DET) graph. It is concluded that neural network based SADcomparatively better than Gaussian mixture model based SAD.

Authors and Affiliations

Sukhvinder Kaur, J. S. Sohal

Keywords

Related Articles

A Comparison of Filtering Techniques for Image Quality Improvement in Computed Tomography

Computed Tomography (CT) is an important and most common modality in medical imaging. In CT examinations there is trade off between radiation dose and image quality. If radiation dose is decreased, the noise will unavoid...

EFFICIENT MANET- INTERNET INTEGRATION FOR MOBILE DEVICES

A mobile ad hoc network (MANET) consists of wireless mobile nodes without having a fixed infrastructure support. The communication between these mobile nodes is carried out without any centralized control. The communicat...

Providing security for Web Service Composition using Finite State Machine

The revolution impacted by Web Service as a solution to business and enterprise application integration throws light on the significance of security provided by Web Services during Web Service Composition. Satisfying the...

A Novel Way to Detect Hard Exudates Using Dynamic Thresholding Technique in Digital Retinal Fundus Image

Diabetic retinopathy is considered to be one of the major causes of blindness among diabetes mellitus patients. Due to diabetic retinopathy blood vessels of retina gets damaged and fat, lipoprotein substances gets leaked...

A tool for prototyping a precision GPS system

The GPS allows locating an object in any part of the World with a certain degree of accuracy. Some precision activities need to operate with a sub-metric level of accuracy. This paper introduces a data analysis tool to p...

Download PDF file
  • EP ID EP650926
  • DOI 10.24297/ijct.v16i1.5893
  • Views 89
  • Downloads 0

How To Cite

Sukhvinder Kaur, J. S. Sohal (2017). Speech Activity Detection and its Evaluation in Speaker Diarization System. INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 16(1), 7567-7572. https://europub.co.uk/articles/-A-650926