Urdu Optical Character Recognition Technique for Jameel Noori Nastaleeq Script

Journal Title: Journal of Independent Studies and Research - Computing - Year 2015, Vol 13, Issue 1

Abstract

Urdu OCR's have been an object of interest for many developers in the recent years. Active research is being done pertaining to Urdu OCR’s, but because of the complexity associated with Urdu fonts; it still lacks perfection halting it from coming up to the surface. The main objective was to create a technique that could be applied to any of the existing Urdu fonts/scripts. In this paper, the authors have developed a technique which is capable of extracting the Urdu font “Jameel Noori Nastaleeq” from images and converts it into editable textual Unicodes. The approach comprises of pre-processing techniques, label connected components, feature extraction, and image comparison. The identified objects are saved as templates which are then compared to the white pixel position length database created by the authors in order to identify the templates which are then converted into Unicode.

Authors and Affiliations

Keywords

Related Articles

Smart Bandwidth Friendly Buffer: Handling Overflow in Wireless Mesh Networks

With breakthrough of technological advancement, the significance of data transmission has been in highly demanding. On the other hand, limited buffering capacity has been great challenge that limits the Quality of Servic...

Performance Comparison of NOSQL Database Cassandra and SQL Server for Large Databases.

The performance comparison of NoSQL database and a Relational Database Management Systems has been done to identify which database responds faster to specific types of requests and suitability of these databases for diff...

A Semi-supervised approach to Document Clustering with Sequence Constraints

Document clustering is usually performed as an unsupervised task. It attempts to separate different groups of documents (clusters) from a document collection based on implicitly identifying the common patterns present in...

Implementation of Adaptive Control Algorithm to Overcome the Traffic Congestion Problems of Karachi

Traffic controlling and management is a severe issue of urban cities as well as on high ways in developing countries like South Asian countries but here particularly, in Pakistan. The traffic congestion problem is becomi...

Analysis of SSD Utilization by Graph Processing Systems

Graph Processing Systems are highly productive when it comes to graph data. While using data parallel approach, it could not exploit common characteristics of a graph computation workload. To address all these challenges...

Download PDF file
  • EP ID EP643245
  • DOI 10.31645/jisrc/(2015).13.1.0011
  • Views 150
  • Downloads 0

How To Cite

(2015). Urdu Optical Character Recognition Technique for Jameel Noori Nastaleeq Script. Journal of Independent Studies and Research - Computing, 13(1), 81-86. https://europub.co.uk/articles/-A-643245