Modified Grapheme Encoding and Phonemic Rule to Improve PNNR-Based Indonesian G2P

Abstract

A grapheme-to-phoneme conversion (G2P) is very important in both speech recognition and synthesis. The existing Indonesian G2P based on pseudo nearest neighbour rule (PNNR) has two drawbacks: the grapheme encoding does not adapt all Indonesian phonemic rules and the PNNR should select a best phoneme from all possible conversions even though they can be filtered by some phonemic rules. In this paper, a modified partial orthogonal binary grapheme encoding and a phonemic-based rule are proposed to improve the performance of PNNR-based Indonesian G2P. Evaluating on 5-fold cross-validation, contain 40K words to develop the model and 10K words to evaluation each, shows that both proposed concepts reduce the relative phoneme error rate (PER) by 13.07%. A more detail analysis shows the most errors are from grapheme ?e? that can be dynamically converted into either /E/ or /??/ since four prefixes, ’ber’, ’me’, ’per’, and ’ter’, produce many ambiguous conversions with basic words and also from some similar compound words with both different pronunciations for the grapheme ?e?. A stemming procedure can be applied to reduce those errors.

Authors and Affiliations

Suyanto , Sri Hartati, Agus Harjoko

Keywords

Related Articles

 Hybrid Denoising Method for Removal of Mixed Noise in Medical Images

 Nowadays, Digital image acquisition and processing techniques plays a very important role in current day medical diagnosis. During the acquisition process, there could be distortions in the images, which will negat...

Performance Evaluation of Routing Protocol (RPL) for Internet of Things

Recently, Internet Engineering Task Force (IETF) standardized a powerful and flexible routing protocol for Low Power and Lossy Networks (RPL). RPL is a routing protocol for low power and lossy networks in the Internet of...

An Immunity-based Error Containment Algorithm for Database Intrusion Response Systems

The immune system has received a special attention as a potential source of inspiration for innovative approaches to solve database security issues and build artificial immune systems. Database security issues need to be...

Impact of Thyristor Controlled Series Capacitor on Voltage Profile of Transmission Lines using PSAT

In power system voltage stability is very important in order to maintain the voltage within the defined limits. The demand of electrical power increases in the last decade due to the lack of expansion in the generation a...

Modeling of Quadrotor Roll Loop using Frequency Identification Method

Model estimation is an important step in quadrotor control design because model uncertainties can cause unstable behavior especially with non-robust control methods. In this paper, a modeling approach of a quadrotor prot...

Download PDF file
  • EP ID EP143769
  • DOI 10.14569/IJACSA.2016.070358
  • Views 96
  • Downloads 1

How To Cite

Suyanto, Sri Hartati, Agus Harjoko (2016). Modified Grapheme Encoding and Phonemic Rule to Improve PNNR-Based Indonesian G2P. International Journal of Advanced Computer Science & Applications, 7(3), 430-435. https://europub.co.uk/articles/-A-143769