A Flexible Architecture for Urdu Phonemes-Based Concatenative Speech Synthesis

Abstract

TTS (Text-to-Speech) synthesis systems are extensively used across the world to intensify the accessibility of information and to make it possible for the handicapped to be involved directly with computers to get the benefits from this high technology revolution. Various TTS synthesis techniques have been used with their own advantages and limitations. There is not a concatenative synthesis strategy based architecture for Urdu TTS synthesis system for handling the homographs and to avoid the unnatural robot sounding speech produced due the use of di-phones. In this paper, we propose a flexible architecture for Urdu TTS synthesis system that uses concatenative synthesis strategy because this approach has the ability to join together the small corpus of speech to generate natural and intelligible sound. The main aspiration of this research is to disambiguate the homographs in the Urdu language and to avoid the unnatural robot sounding speech. Finally, the effectiveness of the system is tested in terms of intelligibility and acceptability on word and sentence level. The intelligibility rate is near to 80% and 65% while acceptability rate for the naturalness is 95% (75% natural, 20% acceptable).

Authors and Affiliations

Muhammad Rizwan Ahmad, Muhammad Junaid Arshad

Keywords

Related Articles

Supporting Adaptation of Wireless Communication Protocols

Pervasive devices such as mobile phones and PDAs (Personal Digital Assistants) come with different wireless communication capabilities, for example, WiFi (Wireless Fidelity), Bluetooth, IrDA (Infrared), etc. In order for...

An Effective Channel Allocation Scheme to Reduce Co-Channel and Adjacent Channel Interference for WMN Backhaul

Two folded work presents channel allocation scheme sustaining channel orthogonality and channel spacing to reduce CCI (Co-Channel Interference) and ACI (Adjacent Channel Interference) for inter flow of an intra-flow link...

Linear Shrinkage Behaviour of Compacted Loam Masonry Blocks

Walls of wet loam, used in earthen houses, generally experience more shrinkage which results in cracks and less compressive strength. This paper presents a technique of producing loam masonry blocks that are compacted in...

Architecture of WiFi Based Broadcast Network for Rural Community

Digital divide is a reality in developing nations. Most of the technological advancements are available only in urban areas and rural community is still deprived of communication technology even in 21 st century. To ensu...

Effectiveness and Future Prospects of Telemedicine/Remote Health Care Management Applications in Pakistan

Medical/Health care system is spraining in Pakistan because of innovative technology, activities and services as per their financial cost (position) which is increasing day by day. This research is intended for the asses...

Download PDF file
  • EP ID EP184338
  • DOI -
  • Views 101
  • Downloads 0

How To Cite

Muhammad Rizwan Ahmad, Muhammad Junaid Arshad (2016). A Flexible Architecture for Urdu Phonemes-Based Concatenative Speech Synthesis. Mehran University Research Journal of Engineering and Technology, 35(3), 373-380. https://europub.co.uk/articles/-A-184338