Telugu Bigram Splitting using Consonant-based and Phrase-based Splitting

Abstract

Splitting is a conventional process in most of Indian languages according to their grammar rules. It is called ‘pada vicchEdanam’ (a Sanskrit term for word splitting) and is widely used by most of the Indian languages. Splitting plays a key role in Machine Translation (MT) particularly when the source language (SL) is an Indian language. Though this splitting may not succeed completely in extracting the root words of which the compound is formed, but it shows considerable impact in Natural Language Processing (NLP) as an important phase. Though there are many types of splitting, this paper considers only consonant based and phrase based splitting.

Authors and Affiliations

T. Rao, Dr. T. V. Prasad

Keywords

Related Articles

A Parallel Genetic Algorithm for Maximum Flow Problem

The maximum flow problem is a type of network optimization problem in the flow graph theory. Many important applications used the maximum flow problem and thus it has been studied by many researchers using different meth...

Experimental Results on Agent-Based Indoor Localization using WiFi Signaling

This paper discusses experimental results on the possibility of accurately estimating the position of smart devices in known indoor environments using agent technology. Discussed localization approaches are based on WiFi...

fMRI Data Analysis Using Dempster-Shafer Method with Estimating Voxel Selectivity by Belief Measure

In the functional Magnetic Resonance Imaging (fMRI) data analysis, detecting the activated voxels is a challenging research problem where the existing methods have shown some limits. We propose a new method wherein brain...

Hypercube Graph Decomposition for Boolean Simplification: An Optimization of Business Process Verification

This paper deals with the optimization of busi-ness processes (BP) verification by simplifying their equivalent algebraic expressions. Actual approaches of business processes verification use formal methods such as autom...

A Comparative Usability Study on the Use of Auditory Icons to Support Virtual Lecturers in E-Learning Interfaces

Prior conducted research revealed that the auditory icons could contribute in supporting the virtual lecturers in presence of full body animation while delivering the learning content in e-learning interfaces. This paper...

Download PDF file
  • EP ID EP131641
  • DOI 10.14569/IJACSA.2014.050518
  • Views 88
  • Downloads 0

How To Cite

T. Rao, Dr. T. V. Prasad (2014). Telugu Bigram Splitting using Consonant-based and Phrase-based Splitting. International Journal of Advanced Computer Science & Applications, 5(5), 122-128. https://europub.co.uk/articles/-A-131641