Learning Assistant in Educational Field Using Automatic Speech Recognition
Journal Title: International Journal of Innovative Research in Computer Science and Technology - Year 2013, Vol 1, Issue 2
Abstract
Automatic Speech recognition is the translation of spoken words into text. It takes speech data as input and divides it into small time domain frames. Speech signal processing considering speech signals stationary for a small time interval. From point of view speech signals are divided into small units Morphims or Phonims. Any speech data can be sorted as word uttered followed by voice and silence intervals. Voice activity detection can be are employed to detect voiced and unvoiced part of speech. Speech processing consists of speech recognition, speech synthesis, speaker recognition, understanding of speech with reference to context, speech coding, speech enhancement, speech transmission, speech to text conversion & text to speech conversion etc. In general speech to text conversion system will convert input speech data to output text data. If the input speech data is inappropriate with some errors then there is a possibility to get incorrect output data. The proposed system contains options for correction of inappropriate input data so that the output text and speech data produce and pronounce is correct. The proposed system will be employed as learning assistance in educational field for students to learn correct pronunciation of words. The proposed system will also help tourists for conversation in local language.
Authors and Affiliations
PrajaktaKotwal, Prof. M. R. Dixit
Performance Comparison of Efficient Identity Based Signature Schemes
In a conventional public key crypto system, the participants must verify the certificate prior to use the public key. The main drawback of the certificate in conventional public key system are large storage, large comput...
A New Security Primitive Based on Hard AI Problems and Efficient User Authentication using Captcha and Graphical Passwords
In this paper,I present a new security primitive based on hard AI (Artificial Intelligence) problems, namely, a novel family of graphical password systems built on top of Captcha technology, which we call Captcha as grap...
Pitch Estimation and Analysis of speech signal
Speech is the principal form of human communication since it began from day one when human beings start to communicate. The rate of vibration produce by the vocal cords is called a fundamental frequency (F0) or pitch per...
Requirements of mHealth-Based Medication Management Systems
Medication error is one of the healthcare challenges which effects 10 percent of individuals around the world and medication management is a complicated process including multiple activities. One of these activities is m...
Portability in the Enterprise Applications
Fast development of applications and its growing reputation in recent years has motivated various IT organizations want to move application between one platforms to another, so portability is a rising concern. Portabilit...