Glyph Identification and Character Recognition for Sindhi OCR

Abstract

A computer can read and write multiple languages and today?s computers are capable of understanding various human languages. A computer can be given instructions through various input methods but OCR (Optical Character Recognition) and handwritten character recognition are the input methods in which a scanned page containing text is converted into written or editable text. The change in language text available on scanned page demands different algorithm to recognize text because every language and script pose varying number of challenges to recognize text. The Latin language recognition pose less difficulties compared to Arabic script and languages that use Arabic script for writing and OCR systems for these Latin languages are near to perfection. Very little work has been done on regional languages of Pakistan. In this paper the Sindhi glyphs are identified and the number of characters and connected components are identified for this regional language of Pakistan. A graphical user interface has been created to perform identification task for glyphs and characters of Sindhi language. The glyphs of characters are successfully identified from scanned page and this information can be used to recognize characters. The language glyph identification can be used to apply suitable algorithm to identify language as well as to achieve a higher recognition rate.

Authors and Affiliations

N. A. Memon, F. Abassi, S. Zardari

Keywords

Related Articles

Energy and Exergy Analysis of a Coal Fired Power Plant

In this paper, energy and exergy analysis has been conducted on a subcritical coal fired power plant of Wisconsin Power and Light Company, USA to investigate the steam cycle energy and exergy efficiency. The cycle is ana...

Effect of Nano-Ceria on Physiognomies of Aluminum-5% Zinc Sacrificial Anode

Sacrificial anodes possessing higher electrochemical efficiency is the demand of marine, oil and gas industries. Due to high energy capability and long life light weight aluminum based anodes are more favorable as compar...

Study of Soil, Water, and Cropping Pattern in Danastar Wah (Manchar Lake) Command Area Using Geospatial Tools

The effluent water brought by RBOD (Right Bank Outfall Drain) is not only threat to the aquatic life of Manchar Lake but also the fertile agricultural lands which are being cultivated by use of lake water through Danasta...

Conjugated Conduction-Free Convection Heat Transfer in an Annulus Heated at Either Constant Wall Temperature or Constant Heat Flux

In this paper, we investigate numerically the effect of thermal boundary conditions on conjugated conduction-free convection heat transfer in an annulus between two concentric cylinders using Fourier Spectral method. The...

Tuning COCOMO-II for Software Process Improvement: A Tool Based Approach

In order to compete in the international software development market the software organizations have to adopt internationally accepted software practices i.e. standard like ISO (International Standard Organization) or CM...

Download PDF file
  • EP ID EP226281
  • DOI 10.22581/muet1982.1704.18
  • Views 116
  • Downloads 0

How To Cite

N. A. Memon, F. Abassi, S. Zardari (2017). Glyph Identification and Character Recognition for Sindhi OCR. Mehran University Research Journal of Engineering and Technology, 36(4), 933-940. https://europub.co.uk/articles/-A-226281