Investigation of Pitch and Duration Range in Speech of Sindhi Adults for Prosody Generation Module

Abstract

Prosody refers to structure of sound and rhythm and both are essential parts of speech processing applications. It comprises of tone, stress, intonation and rhythm. Pitch and duration are the core elements of acoustic and that information can make easy to design and development for application module. Through these two peculiarities, the prosody module can be validated. These two factors have been investigated using the sounds of Sindhi adults and presented in this paper. For the experiment and analysis, 245 male and female undergraduate students were selected as speakers belonging from five different districts of upper Sindh and categorized into groups according to their age. Particular sentences were given and recorded individually from the speakers. Afterward, these sentences segmented into words and stored in a database consisting of 1960 sounds. Thus, distance of the frequency in pitch was measured via Standard Deviation (SD). The lowest Mean SD accompanied 0.25Hz and 0.28Hz received from male and female group of district Sukkur. The highest Mean SD has measured with male and female group of district Ghotki along 0.42Hz and 0.49Hz. Generally, the pitch of female’s speakers was found high in contrast to male’s speaker by 0.072Hz variation.

Authors and Affiliations

Shahid Ali Mahar, Mumtaz Hussain Mahar, Shahid Hussain Danwar, Javed Ahmed Mahar

Keywords

Related Articles

 Effect Of A Video-Based Laboratory On The High School Pupils’ Understanding Of Constant Speed Motion

 Among the physical phenomena studied in high school, the kinematical concepts are important because they constitute a precondition for the study of subsequent concepts of mechanics. Our research aims at studying th...

Energy-Aware Virtual Network Embedding Approach for Distributed Cloud

Network virtualization has caught the attention of many researchers in recent years. It facilitates the process of creating several virtual networks over a single physical network. Despite this advantage, however, networ...

Haze Effects on Satellite Remote Sensing Imagery and their Corrections

Imagery recorded using satellite sensors operating at visible wavelengths can be contaminated by atmospheric haze that originates from large scale biomass burning. Such issue can reduce the reliability of the imagery and...

Adaptive Multilayered Particle Swarm Optimized Neural Network (AMPSONN) for Pipeline Corrosion Prediction

Artificial Neural Network (ANN) design has long been a complex problem because its performance depends heavily on the network topology and algorithm to train the set of synaptic weights. Particle Swarm Optimization (PSO)...

Clustering: Applied to Data Structuring and Retrieval

Clustering is a very useful scheme for data structuring and retrieval behuhcause it can handle large volumes of multi-dimensional data and employs a very fast algorithm. Other forms of data structuring techniques include...

Download PDF file
  • EP ID EP645818
  • DOI 10.14569/IJACSA.2019.0100924
  • Views 73
  • Downloads 0

How To Cite

Shahid Ali Mahar, Mumtaz Hussain Mahar, Shahid Hussain Danwar, Javed Ahmed Mahar (2019). Investigation of Pitch and Duration Range in Speech of Sindhi Adults for Prosody Generation Module. International Journal of Advanced Computer Science & Applications, 10(9), 187-195. https://europub.co.uk/articles/-A-645818