Segmentation of Telugu Touching Conjunct Consonants Using Overlapping Bounding Boxes

Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 6

Abstract

Telugu is an ancient historic language. It is spoken by about 84.6 million people of Andhra Pradesh. The script has circular orthography with few horizontal and slant strokes. Huge literature exists for this language in printed form which needs to be preserved by scanning and converting it into editable form. Segmentation of touching characters is a major issue in any OCR system. Segmenting the words into individual glyphs by Connected Component Analysis yields poor results due to touching characters. Touching conjunct consonants is the major component which needs to be properly addressed for improving the accuracy of an OCR system. In this paper an overlapping bounding box approach is presented for segmenting the conjunct consonants along with an algorithm for identifying the correct touching location. An accuracy rate of 91.27% is achieved.

Authors and Affiliations

J. Bharathi , Dr. P. Chandrasekar Reddy

Keywords

Related Articles

Enhancing the Communication Channel Through Secure Shell And Irrational DES

As the internet grows in popularity and therefore also in size more and more transmission takes place mainly because the technology is more readily available and applications have become more user friendly allowing entry...

A Single Fromat for Measuring different Aspects of Testing

In-Process testing metrics has been used from some years and its usage is frequently increasing. There are different metrics for software testing i.e to measure testing progress, Mean time between arrival of error, densi...

A Voice Priority Queue (VPQ) Fair Scheduler for the VoIP over WLANs

Transmission of VoIP over packet switching networks is one of the rapidly emerging real-time Internet Protocol. The real-time application of the Voice over Internet Protocol (VoIP) is growing rapidly for it is more flexi...

Log Mining Based on Hadoop’s Map and Reduce Technique

In the world of cloud and grid computing Virtual Database Technology (VDB) is one of the effective solutions for integration of data from heterogeneous sources. Hadoop is a large-scale distributed batch processing infras...

EVALUATION OF CBIR APPROACHES FOR DIFFERENTLY SIZED IMAGES

CBIR is the application of computer vision techniques to the image retrieval problem, that is, the problem of searching for digital images in large databases. An experimental comparison of a number of different color des...

Download PDF file
  • EP ID EP156687
  • DOI -
  • Views 143
  • Downloads 0

How To Cite

J. Bharathi, Dr. P. Chandrasekar Reddy (2013). Segmentation of Telugu Touching Conjunct Consonants Using Overlapping Bounding Boxes. International Journal on Computer Science and Engineering, 5(6), 538-546. https://europub.co.uk/articles/-A-156687