Telugu Bigram Splitting using Consonant-based and Phrase-based Splitting

Abstract

Splitting is a conventional process in most of Indian languages according to their grammar rules. It is called ‘pada vicchEdanam’ (a Sanskrit term for word splitting) and is widely used by most of the Indian languages. Splitting plays a key role in Machine Translation (MT) particularly when the source language (SL) is an Indian language. Though this splitting may not succeed completely in extracting the root words of which the compound is formed, but it shows considerable impact in Natural Language Processing (NLP) as an important phase. Though there are many types of splitting, this paper considers only consonant based and phrase based splitting.

Authors and Affiliations

T. Rao, Dr. T. V. Prasad

Keywords

Related Articles

Industrial Financial Forecasting using Long Short-Term Memory Recurrent Neural Networks

This research deals with the industrial financial forecasting in order to calculate the yearly expenditure of the organization. Forecasting helps in estimation of the future trends and provides a valuable information to...

Non-Linear Energy Harvesting Dual-hop DF Relaying System over n-µ Fading Channels

In this work, we analyze a wireless energy harvest-ing decode-and-forward (DF) relaying network with beamforming that is based on a practical non-linear energy harvesting model over η-μ fading channels. We consider a dua...

Sentiment Analysis Challenges of Informal Arabic Language

Recently, there are wide numbers of users that use the social network like Twitter, Facebook, MySpace to share various kinds of resources, express their opinions, thoughts, messages in real time. Thus, increase the amoun...

A Survey of Energy Aware Cloud’s Resource Allocation Techniques for Virtual Machine Consolidation

As the demand for cloud computing environment is increasing, new techniques for making cloud computing more environment-friendly are being proposed with an aim to convert traditional cloud computing into green cloud comp...

Feature Subsumption for Sentiment Classification of Dynamic Data in Social Networks using SCDDF

The analysis of opinions till now is done mostly on static data rather than on the dynamic data. Opinions may vary in time. Earlier methods concentrated on opinions expressed in an individual site. But on a given concept...

Download PDF file
  • EP ID EP131641
  • DOI 10.14569/IJACSA.2014.050518
  • Views 116
  • Downloads 0

How To Cite

T. Rao, Dr. T. V. Prasad (2014). Telugu Bigram Splitting using Consonant-based and Phrase-based Splitting. International Journal of Advanced Computer Science & Applications, 5(5), 122-128. https://europub.co.uk/articles/-A-131641