Partial Greedy Algorithm to Extract a Minimum Phonetically-and-Prosodically Rich Sentence Set
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 12
Abstract
A phonetically-and-prosodically rich sentence set is so important in collecting a read-speech corpus for developing phoneme-based speech recognition. The sentence set is usually searched from a huge text corpus of million sentences using the optimization methods. One of the commonly used optimization methods for this case is a Least-to-Most Greedy (LTMG) algo-rithm. It is effective in minimizing the number of phoneme-units. Unfortunately, it does not distribute their frequencies. In this paper, a new method called Partial LTMG algorithm (PLTMG) is proposed to search an optimum set containing triphones and prosodies those are distributed in a near-uniform fashion. Testing on an Indonesian text corpus of ten million sentences crawled from some websites of newspapers and novels shows that the proposed method is not only capable of minimizing both phoneme-units and prosodies but also effective in distributing their frequencies.
Authors and Affiliations
Fahmi Alfiansyah, Suyanto Suyanto
Validating Utility of TEIM: A Comparative Analysis
Concrete efforts to integrate Software Engineering and Human Computer Interaction exist in the form of models by many researchers. An unconventional model called TEIM (The Evolved Integrated Model) of Software Engineerin...
Intelligent System for Detection of Micro-Calcification in Breast Cancer
Recently; medical image mining has become one of the well-recognized research area(s) of machine learning and artificial intelligence techniques have been vastly used in various computer added diagnostic systems. Specifi...
Connectivity Resotration Techniques for Wireless Sensor and Actor Network (WSAN), A Review
Wireless Sensor and actor networks (WSANs) are the most promising research area in the field of wireless communication. It consists of large number of small independent sensor and powerful actor nodes equipped with commu...
Fault Attacks Resistant Architecture for KECCAK Hash Function
The KECCAK cryptographic algorithms widely used in embedded circuits to ensure a high level of security to any systems which require hashing as the integrity checking and random number generation. One of the most efficie...
Optimization and Evaluation of Hybrid PV/WT/BM System in Different Initial Costs and LPSP Conditions
A modelling and optimization study was performed to manage energy demand of a faculty in Karabuk University campus area working with a hybrid energy production system by using genetic algorithm (GA). Hybrid system consis...