Audio Search Based on Keyword Spotting in Arabic Language

Abstract

Keyword spotting is an important application of speech recognition. This research introduces a keyword spotting approach to perform audio searching of uttered words in Arabic speech. The matching process depends on the utterance nucleus which is insensitive to its context. For spotting the targeted utterances, the matched nuclei are expanded to cover the whole utterances. Applying this approach to Quran and standard Arabic has promising results. To improve this spotting approach, it is combined with a text search in case of the existence of a transcript. This can be applied on Quran as there is exact correspondence between the audio and text files of each verse. The developed approach starts by text search to identify the verses that include the target utterance(s). For each allocated verse, the occurrence(s) of the target utterance is determined. The targeted utterance (the reference) is manually segmented from an allocated verse. Then Keyword spotting is performed for the extracted reference to the corresponding audio file. The accuracy of the spotted utterances achieved 97%. The experiments showed that the use of the combined text and audio search has reduced the search time by 90% when compared with audio search only tested on the same content. The developed approach has been applied to non transcribed audio files (preaches and News) for searching chosen utterances. The results are promising. The accuracy of spotting was around 84% in case of preaches and 88% in case of the news.

Authors and Affiliations

Mostafa Awaid, Sahar Fawzi, Ahmed Kandil

Keywords

Related Articles

Arabic Sign Language (ArSL) Recognition System Using HMM

Hand gestures enabling deaf people to communication during their daily lives rather than by speaking. A sign language is a language which, instead of using sound, uses visually transmitted gesture signs which simultaneou...

Identification and Nonlinear PID Control of Hammerstein Model using Polynomial Structures

In this paper, a new nonlinear discrete-time PID is proposed to control Hammerstein model. This model is composed by a static nonlinearity gain associated to a linear dynamic sub-system. Nonlinear polynomial structures a...

Media Content Access: Image-based Filtering

As the content on the internet contains sensitive adult material, filtering and blocking this content is essential for the social and ethical values of the many societies and organizations. In this paper, the content fil...

Examining Software Intellectual Property Rights

Intellectual property rights (IPR) of computer software is the right to assign the software to its creator, not limited to time and space, and non-transferable. Proving IPR of the creators of computer software requires a...

Detection of Soft Atheroscelarotic Plaques in Cardiac Computed Tomography Angiography

Computed tomography angiography (CTA) has turned non-invasive diagnosis of cardiovascular anomalies into a reality as state-of-the-art imaging equipment is capable of recording sub-millimeter details. Based on high inten...

Download PDF file
  • EP ID EP141854
  • DOI 10.14569/IJACSA.2014.050219
  • Views 127
  • Downloads 0

How To Cite

Mostafa Awaid, Sahar Fawzi, Ahmed Kandil (2014). Audio Search Based on Keyword Spotting in Arabic Language. International Journal of Advanced Computer Science & Applications, 5(2), 128-133. https://europub.co.uk/articles/-A-141854