Audio Search Based on Keyword Spotting in Arabic Language

Abstract

Keyword spotting is an important application of speech recognition. This research introduces a keyword spotting approach to perform audio searching of uttered words in Arabic speech. The matching process depends on the utterance nucleus which is insensitive to its context. For spotting the targeted utterances, the matched nuclei are expanded to cover the whole utterances. Applying this approach to Quran and standard Arabic has promising results. To improve this spotting approach, it is combined with a text search in case of the existence of a transcript. This can be applied on Quran as there is exact correspondence between the audio and text files of each verse. The developed approach starts by text search to identify the verses that include the target utterance(s). For each allocated verse, the occurrence(s) of the target utterance is determined. The targeted utterance (the reference) is manually segmented from an allocated verse. Then Keyword spotting is performed for the extracted reference to the corresponding audio file. The accuracy of the spotted utterances achieved 97%. The experiments showed that the use of the combined text and audio search has reduced the search time by 90% when compared with audio search only tested on the same content. The developed approach has been applied to non transcribed audio files (preaches and News) for searching chosen utterances. The results are promising. The accuracy of spotting was around 84% in case of preaches and 88% in case of the news.

Authors and Affiliations

Mostafa Awaid, Sahar Fawzi, Ahmed Kandil

Keywords

Related Articles

 A Comparative Study on Temporal Mobile Access Pattern Mining Methods

 Mobile users behavior patterns is one of the most critical issues that need to be explored in mobile agent systems. Recently the algorithms of discovering frequent mobile user’s behavior patterns have been studied...

A Sleep Monitoring System with Sleep-Promoting Functions in Noise Detection and Sound Generation

Recently, there has been a growing demand and interest in developing sleep-promoting systems for improving sleep condition. Because sleep environments are various, and sensitivity to noise differs individually, it is dif...

A Zone Classification Approach for Arabic Documents using Hybrid Features

Zone segmentation and classification is an important step in document layout analysis. It decomposes a given scanned document into zones. Zones need to be classified into text and non-text, so that only text zones are pr...

A New Mixed Signal Platform to Study the Accuracy/Complexity Trade-Off of DPD Algorithms

The increase in bandwidth of Power Amplifier (PA) input signals has led to the development of more complex behavioral PA models. Most recent models such as the Generalized Memory Polynomial (1) or the Polyharmonic distor...

Hyperspectral Image Segmentation using Homogeneous Area Limiting and Shortest Path Algorithm

Segmentation, as a preprocessing, plays an important role in hyperspectral images. In this paper, considering the similarity of neighboring pixels, using the size measure, the image spectrum is divided into several segme...

Download PDF file
  • EP ID EP141854
  • DOI 10.14569/IJACSA.2014.050219
  • Views 130
  • Downloads 0

How To Cite

Mostafa Awaid, Sahar Fawzi, Ahmed Kandil (2014). Audio Search Based on Keyword Spotting in Arabic Language. International Journal of Advanced Computer Science & Applications, 5(2), 128-133. https://europub.co.uk/articles/-A-141854