Glyph Identification and Character Recognition for Sindhi OCR

Abstract

A computer can read and write multiple languages and today?s computers are capable of understanding various human languages. A computer can be given instructions through various input methods but OCR (Optical Character Recognition) and handwritten character recognition are the input methods in which a scanned page containing text is converted into written or editable text. The change in language text available on scanned page demands different algorithm to recognize text because every language and script pose varying number of challenges to recognize text. The Latin language recognition pose less difficulties compared to Arabic script and languages that use Arabic script for writing and OCR systems for these Latin languages are near to perfection. Very little work has been done on regional languages of Pakistan. In this paper the Sindhi glyphs are identified and the number of characters and connected components are identified for this regional language of Pakistan. A graphical user interface has been created to perform identification task for glyphs and characters of Sindhi language. The glyphs of characters are successfully identified from scanned page and this information can be used to recognize characters. The language glyph identification can be used to apply suitable algorithm to identify language as well as to achieve a higher recognition rate.

Authors and Affiliations

N. A. Memon, F. Abassi, S. Zardari

Keywords

Related Articles

A Flexible Architecture for Urdu Phonemes-Based Concatenative Speech Synthesis

TTS (Text-to-Speech) synthesis systems are extensively used across the world to intensify the accessibility of information and to make it possible for the handicapped to be involved directly with computers to get the ben...

Effects of Cu and Zn Coated Urea on Eh, pH and Solubility of Cu and Zn in Rice Soils

The concentration of Cu (Copper) and Zn (Zinc) decreases upon flooded conditions of rice soil. To assess the effects of flooding and application of Cu and Zn coated urea on changes in Eh, pH and solubility of Cu and Zn,...

Measuring the Role of Trust in M-Commerce Acceptance: An Empirical Analysis in Context of Pakistan

With the emergence of internet and WWW (World Wide Web), traditional businesses got a new opportunity to compete globally. A new term of M-Commerce (Mobile Commerce) emerged and set a new trend in commerce and business....

Evaluation of Daylight Intensity for Sustainbility in Residential Buildings in Cantonment Cottages Multan

Day lighting is a useful and effective source of energy savings and visual comforts in buildings. Occupants expect good daylight in their living spaces for better living environment. The quality and quantity of natural l...

Groundwater Quality Mapping using Geographic Information System: A Case Study of District Thatta, Sindh

Access to safe and affordable drinking water for all is an important goal of SDGs (Sustainable Development Goals). Degradation of water quality of coastal aquifers is a major concern throughout the world including the In...

Download PDF file
  • EP ID EP226281
  • DOI 10.22581/muet1982.1704.18
  • Views 121
  • Downloads 0

How To Cite

N. A. Memon, F. Abassi, S. Zardari (2017). Glyph Identification and Character Recognition for Sindhi OCR. Mehran University Research Journal of Engineering and Technology, 36(4), 933-940. https://europub.co.uk/articles/-A-226281