Comparative Analysis of Raw Images and Meta Feature based Urdu OCR using CNN and LSTM
Journal Title: International Journal of Advanced Computer Science & Applications - Year 2018, Vol 9, Issue 1
Abstract
Urdu language uses cursive script which results in connected characters constituting ligatures. For identifying characters within ligatures of different scales (font sizes), Convolution Neural Network (CNN) and Long Short Term Memory (LSTM) Network are used. Both network models are trained on formerly extracted ligature thickness graphs, from which models extract Meta features. These thickness graphs provide consistent information across different font sizes. LSTM and CNN are also trained on raw images to compare performance on both forms of inputs. For this research, two corpora, i.e. Urdu Printed Text Images (UPTI) and Centre for Language Engineering (CLE) Text Images are used. Overall performance of networks ranges between 90% and 99.8%. Average accuracy on Meta features is 98.08% while using raw images, 97.07% average accuracy is achieved.
Authors and Affiliations
Asma Naseer, Kashif Zafar
YAWARweb: Pilot Study about the usage of a Web Service to Raise Awareness of Blood Donation Campaigns on University Campuses in Lima, Peru
This document presents a preliminary study about a pilot deployment of a web service. The service is used as means to raise awareness in university campuses prior to blood donation campaigns and to measure its effect int...
Big Data Classification Using the SVM Classifiers with the Modified Particle Swarm Optimization and the SVM Ensembles
The problem with development of the support vector machine (SVM) classifiers using modified particle swarm optimization (PSO) algorithm and their ensembles has been considered. Solving this problem would allow fulfilling...
Automatic Detection Technique for Speech Recognition based on Neural Networks Inter-Disciplinary
Automatic speech recognition allows the machine to understand and process information provided orally by a human user. It consists of using matching techniques to compare a sound wave to a set of samples, usually compose...
Survey on Human Activity Recognition based on Acceleration Data
Human activity recognition is an important area of machine learning research as it has many utilization in different areas such as sports training, security, entertainment, ambient-assisted living, and health monitoring...
Security Issues in Cloud Computing and their Solutions: A Review
Cloud computing is an internet-based, emerging technology, tends to be prevailing in our environment especially computer science and information technology fields which require network computing on large scale. Cloud com...