INDONESIAN TEXT-TO-SPEECH SYSTEM USING DIPHONE CONCATENATIVE SYNTHESIS

Apply

INDONESIAN TEXT-TO-SPEECH SYSTEM USING DIPHONE CONCATENATIVE SYNTHESIS

Journal Title: International Journal of Software Engineering and Computer Systems - Year 2015, Vol 1, Issue 1

Abstract

In this paper, we describe the design and develop a database of Indonesian diphone synthesis using speech segment of recorded voice to be converted from text to speech and save it as audio file like WAV or MP3. In designing and develop a database of Indonesian diphone there are several steps to follow; First, developed Diphone database includes: create a list of sample of words consisting of diphones organized by prioritizing looking diphone located in the middle of a word if not at the beginning or end; recording the samples of words by segmentation. ;create diphones made with a tool Diphone Studio 1.3. Second, develop system using Microsoft Visual Delphi 6.0, includes: the conversion system from the input of numbers, acronyms, words, and sentences into representations diphone. There are two kinds of conversion (process) alleged in analyzing the Indonesian text-to-speech system. One is to convert the text to be sounded to phonem and two, to convert the phonem to speech. Method used in this research is called Diphone Concatenative synthesis, in which recorded sound segments are collected. Every segment consists of a diphone (2 phonems). This synthesizer may produce voice with high level of naturalness. The Indonesian Text to Speech system can differentiate special phonemes like in ‘Beda’ and ‘Bedak’ but sample of other spesific words is necessary to put into the system. This Indonesia TTS system can handle texts with abbreviation, there is the facility to add such words.

Authors and Affiliations

Sutarman Sutarman

Keywords

diphone; text to speech; concatenative synthesis

IMPROVED TWO-WAYS CLASSIFICATION FOR AGENT PATTERNS

Agent technology has been used in building various domain specific applications. The agent methodologies are proposed to aid the agent developer with the introduction of techniques, terminology, notation and guidelines d...

THE NEED OF DASHBOARD IN SOCIAL RESEARCH NETWORK SITES FOR RESEARCHERS

Nowadays, dashboard has been widely used by organizations to display information based on their objectives such as monitoring business performance or checking the current trend in the niche market. There is a need to inv...

A PROPOSED FRAMEWORK TO CONTROL RUMOUR PROPAGATION ON TWITTER FOR CRITICAL NATIONAL INFORMATION INFRASTRUCTURE (CNII) ORGANISATIONS

Critical National Information Infrastructure (CNII) organisations in Malaysia consist of many crucial sectors that not solely effect on national e-sovereignty, but also on economy, social and politic matters. Due to the...

FAULT TOLERANCE FOR TWO WHEEL MOBILE ROBOT USING FSM (FINITE STATE MACHINE)

Fault Tolerance (FT) enables system to continue operating despite in the event of failures. Therefore, FT serves as a backup component or procedure that can immediately play its role to minimize any service lost. FT exis...

REVERSIBLE WATERMARKING BASED ON SORTING PREDICTION ALGORITHM

Reversible watermarking has drawn a lot of interest in recent years. Sachnev et al proposed reversible watermarking algorithm by combining prediction technology, histogram shifting technology and sorting technology, whic...

EP ID EP254084
DOI -
Views 127
Downloads 0