Segmentation of Telugu Touching Conjunct Consonants Using Overlapping Bounding Boxes
Journal Title: International Journal on Computer Science and Engineering - Year 2013, Vol 5, Issue 6
Abstract
Telugu is an ancient historic language. It is spoken by about 84.6 million people of Andhra Pradesh. The script has circular orthography with few horizontal and slant strokes. Huge literature exists for this language in printed form which needs to be preserved by scanning and converting it into editable form. Segmentation of touching characters is a major issue in any OCR system. Segmenting the words into individual glyphs by Connected Component Analysis yields poor results due to touching characters. Touching conjunct consonants is the major component which needs to be properly addressed for improving the accuracy of an OCR system. In this paper an overlapping bounding box approach is presented for segmenting the conjunct consonants along with an algorithm for identifying the correct touching location. An accuracy rate of 91.27% is achieved.
Authors and Affiliations
J. Bharathi , Dr. P. Chandrasekar Reddy
Enhancing the Communication Channel Through Secure Shell And Irrational DES
As the internet grows in popularity and therefore also in size more and more transmission takes place mainly because the technology is more readily available and applications have become more user friendly allowing entry...
A Single Fromat for Measuring different Aspects of Testing
In-Process testing metrics has been used from some years and its usage is frequently increasing. There are different metrics for software testing i.e to measure testing progress, Mean time between arrival of error, densi...
A Voice Priority Queue (VPQ) Fair Scheduler for the VoIP over WLANs
Transmission of VoIP over packet switching networks is one of the rapidly emerging real-time Internet Protocol. The real-time application of the Voice over Internet Protocol (VoIP) is growing rapidly for it is more flexi...
Log Mining Based on Hadoop’s Map and Reduce Technique
In the world of cloud and grid computing Virtual Database Technology (VDB) is one of the effective solutions for integration of data from heterogeneous sources. Hadoop is a large-scale distributed batch processing infras...
EVALUATION OF CBIR APPROACHES FOR DIFFERENTLY SIZED IMAGES
CBIR is the application of computer vision techniques to the image retrieval problem, that is, the problem of searching for digital images in large databases. An experimental comparison of a number of different color des...