Urdu Optical Character Recognition Technique for Jameel Noori Nastaleeq Script

Journal Title: Journal of Independent Studies and Research - Computing - Year 2015, Vol 13, Issue 1

Abstract

Urdu OCR's have been an object of interest for many developers in the recent years. Active research is being done pertaining to Urdu OCR’s, but because of the complexity associated with Urdu fonts; it still lacks perfection halting it from coming up to the surface. The main objective was to create a technique that could be applied to any of the existing Urdu fonts/scripts. In this paper, the authors have developed a technique which is capable of extracting the Urdu font “Jameel Noori Nastaleeq” from images and converts it into editable textual Unicodes. The approach comprises of pre-processing techniques, label connected components, feature extraction, and image comparison. The identified objects are saved as templates which are then compared to the white pixel position length database created by the authors in order to identify the templates which are then converted into Unicode.

Authors and Affiliations

Keywords

Related Articles

Online Optical Tomography System Application of Charge-Coupled Device (CCD) for Object Detection in Crystal Clear Water

This research presents an application of Charge-Coupled Device (CCD) linear sensor and laser diode in an optical tomography system. These optoelec- tronic sensors are believed to detect solid objects rather than transpar...

Ontology Driven Requirement Specification

Requirement engineering RE process is an important step of software development lifecycle and it includes a variety of activities starting with requirement elicitation to requirement documentation. This form of engineeri...

Analytical Comparison of RSA and RSA with Chinese Remainder Theorem

RSA encryption algorithm is one of the most powerful public key encryption algorithm. The problem with RSA algorithm is that RSA decryption is relatively slow in comparison to RSA encryption. Chinese Remainder Theorem (C...

An Investigation on Topic Maps Based Document Classification with Unbalance Classes

Classification of imbalanced data has become a widespread problem due to the fact that the most real world datasets are imbalanced. In a classification task, one of the challenges is to learn the feature-space of classif...

An Object Detection using Image Processing in Digital Forensics Science

Object detection is one of the most important sectors in digital forensics science. The object detection technique is valuable for a number of purposes for instance: medical diagnosis scanners, traffic monitoring system,...

Download PDF file
  • EP ID EP643245
  • DOI 10.31645/jisrc/(2015).13.1.0011
  • Views 153
  • Downloads 0

How To Cite

(2015). Urdu Optical Character Recognition Technique for Jameel Noori Nastaleeq Script. Journal of Independent Studies and Research - Computing, 13(1), 81-86. https://europub.co.uk/articles/-A-643245