Key Issues in Vowel Based Splitting of Telugu Bigrams

Abstract

 Splitting of compound Telugu words into its components or root words is one of the important, tedious and yet inaccurate tasks of Natural Language Processing (NLP). Except in few special cases, at least one vowel is necessarily involved in Telugu conjunctions. In the result, vowels are often repeated as they are or are converted into other vowels or consonants. This paper describes issues involved in vowel based splitting of a Telugu bigram into proper root words using Telugu grammar conjunction (‘sandhi’) rules for MT.

Authors and Affiliations

T. Rao, Dr. T. Prasad

Keywords

Related Articles

MINN: A Missing Data Imputation Technique for Analogy-based Effort Estimation

Success and failure of a complex software project are strongly associated with the accurate estimation of development effort. There are numerous estimation models developed but the most widely used among those is Analogy...

Cyber-Security Incidents: A Review Cases in Cyber-Physical Systems

Cyber-Physical Systems refer to systems that have an interaction between computers, communication channels and physical devices to solve a real-world problem. Towards industry 4.0 revolution, Cyber-Physical Systems curre...

Improvement of Persian Spam Filtering by Game Theory

There are different methods for dealing with spams; however, since spammers continuously use tricks to defeat the proposed methods, hence, filters should be constantly updated. In this study, Stackelberg game was used to...

Response Prediction for Chronic HCV Genotype 4 Patients to DAAs

Hepatitis C virus (HCV) is a major cause of chronic liver disease, end stage liver disease and liver cancer in Egypt. Genotype 4 is the prevalent genotype in Egypt and has recently spread to Southern Europe particularly...

A Proposed Integrated Approach for BI and GIS in Health Sector to Support Decision Makers (BIGIS-DSS)

This paper explores the possibilities of adopting Business Intelligence (BI), and Geographic Information System (GIS) to build a spatial intelligence and predictive analytical approach. The proposed approach will help in...

Download PDF file
  • EP ID EP110290
  • DOI 10.14569/SpecialIssue.2014.040102
  • Views 92
  • Downloads 0

How To Cite

T. Rao, Dr. T. Prasad (2014).  Key Issues in Vowel Based Splitting of Telugu Bigrams. International Journal of Advanced Computer Science & Applications, 4(1), 9-16. https://europub.co.uk/articles/-A-110290