Greedy Algorithms to Optimize a Sentence Set Near-Uniformly Distributed on Syllable Units and Punctuation Marks

Abstract

An optimum sentence set that near-uniformly dis-tributed on syllable units and punctuation marks is important to develop a syllable-based automatic speech recognition (ASR). It is usually extracted from a mother set of millions of unique sentences using Modified Least-to-Most (LTM) Greedy algorithm. The Modified LTM Greedy is capable of minimizing the number of syllables but ignores distributing their frequencies. Hence, two schemes are proposed to minimize the number of syllables as well as to distribute their frequencies near-uniformly. Testing on a mother set of 10 million Indonesian sentences shows that both schemes perform better than the Modified LTM Greedy for two syllable units: monosyllables and bisyllables.

Authors and Affiliations

Bagus Nugroho Budi Nurtomo, Suyanto Suyanto

Keywords

Related Articles

An Adaptive Intrusion Detection Method for Wireless Sensor Networks

Current intrusion detection systems for Wireless Sensor Networks (WSNs) which are usually designed to detect a specific form of intrusion or only applied for one specific type of network structure has apparently restrict...

A Universally Designed and Usable Data Visualization for A Mobile Application in the Context of Rheumatoid Arthritis

This paper discusses the design, development and evaluation of a data visualization prototype for a mobile application, for people with rheumatoid arthritis conditions. The visualizations concern ways of displaying graph...

Using Digital Image Processing to Make an Intelligent Gate

This paper presents an automatic system for controlling and dominating building gate based on digital image processing. The system begins with a digital camera, which captures a picture for that vehicle which intends to...

Real-Time Simulation and Analysis of the Induction Machine Performances Operating at Flux Constant

In this paper, we are interested, in a first time, at the study and the implementation of a V/f control for induction machine in real time. After, We are attached to a comparison of the results by simulation and experime...

Recovering and Tracing Links between Software Codes and Test Codes of the Open Source Projects

One of the most important controversial issues in the design and implementation of software is the functionality of the designed system. With impressive efforts of different software teams in the field of the system, the...

Download PDF file
  • EP ID EP408080
  • DOI 10.14569/IJACSA.2018.091035
  • Views 83
  • Downloads 0

How To Cite

Bagus Nugroho Budi Nurtomo, Suyanto Suyanto (2018). Greedy Algorithms to Optimize a Sentence Set Near-Uniformly Distributed on Syllable Units and Punctuation Marks. International Journal of Advanced Computer Science & Applications, 9(10), 291-296. https://europub.co.uk/articles/-A-408080