Comparative Analysis of Raw Images and Meta Feature based Urdu OCR using CNN and LSTM

Abstract

Urdu language uses cursive script which results in connected characters constituting ligatures. For identifying characters within ligatures of different scales (font sizes), Convolution Neural Network (CNN) and Long Short Term Memory (LSTM) Network are used. Both network models are trained on formerly extracted ligature thickness graphs, from which models extract Meta features. These thickness graphs provide consistent information across different font sizes. LSTM and CNN are also trained on raw images to compare performance on both forms of inputs. For this research, two corpora, i.e. Urdu Printed Text Images (UPTI) and Centre for Language Engineering (CLE) Text Images are used. Overall performance of networks ranges between 90% and 99.8%. Average accuracy on Meta features is 98.08% while using raw images, 97.07% average accuracy is achieved.

Authors and Affiliations

Asma Naseer, Kashif Zafar

Keywords

Related Articles

YAWARweb: Pilot Study about the usage of a Web Service to Raise Awareness of Blood Donation Campaigns on University Campuses in Lima, Peru

This document presents a preliminary study about a pilot deployment of a web service. The service is used as means to raise awareness in university campuses prior to blood donation campaigns and to measure its effect int...

Big Data Classification Using the SVM Classifiers with the Modified Particle Swarm Optimization and the SVM Ensembles

The problem with development of the support vector machine (SVM) classifiers using modified particle swarm optimization (PSO) algorithm and their ensembles has been considered. Solving this problem would allow fulfilling...

Automatic Detection Technique for Speech Recognition based on Neural Networks Inter-Disciplinary

Automatic speech recognition allows the machine to understand and process information provided orally by a human user. It consists of using matching techniques to compare a sound wave to a set of samples, usually compose...

Survey on Human Activity Recognition based on Acceleration Data

Human activity recognition is an important area of machine learning research as it has many utilization in different areas such as sports training, security, entertainment, ambient-assisted living, and health monitoring...

Security Issues in Cloud Computing and their Solutions: A Review

Cloud computing is an internet-based, emerging technology, tends to be prevailing in our environment especially computer science and information technology fields which require network computing on large scale. Cloud com...

Download PDF file
  • EP ID EP261663
  • DOI 10.14569/IJACSA.2018.090157
  • Views 74
  • Downloads 0

How To Cite

Asma Naseer, Kashif Zafar (2018). Comparative Analysis of Raw Images and Meta Feature based Urdu OCR using CNN and LSTM. International Journal of Advanced Computer Science & Applications, 9(1), 419-424. https://europub.co.uk/articles/-A-261663