LUCIDAH Ligative and Unligative Characters in a Dataset for Arabic Handwriting

Abstract

Arabic script is inherently cursive, even when machine-printed. When connected to other characters, some Arabic characters may be optionally written in compact aesthetic forms known as ligatures. It is useful to distinguish ligatures from ordinary characters for several applications, especially automatic text recognition. Datasets that do not annotate these ligatures may confuse the recognition system training. Some popular datasets manually annotate ligatures, but no dataset (prior to this work) took ligatures into consideration from the design phase. In this paper, a detailed study of Arabic ligatures and a design for a dataset that considers the representation of ligative and unligative characters are presented. Then, pilot data collection and recognition experiments are conducted on the presented dataset and on another popular dataset of handwritten Arabic words. These experiments show the benefit of annotating ligatures in datasets by reducing error-rates in character recognition tasks.

Authors and Affiliations

Yousef Elarian, Irfan Ahmad, Abdelmalek Zidouri, Wasfi G. Al-Khatib

Keywords

Related Articles

Mobile Web Services: State of the Art and Challenges

For many years mobile devices were commonly recognized as Web consumers. However, the advancements in mobile device manufacturing, coupled with the latest achievements in wireless communication developments are both key...

Analysis of Energy Saving Approaches in Cloud Computing using Ant Colony and First Fit Algorithms

Cloud computing is a style of technology that is increasingly used every day. It requires the use of an important amount of resources that is dynamically provided as a service. The growth of energy consumption associated...

English-Arabic Hybrid Machine Translation System using EBMT and Translation Memory

The availability of a machine translation to translate from English-to-Arabic with high accuracy is not available because of the difficult morphology of the Arabic Language. A hybrid machine translation system between Ex...

A Bayesian Approach to Predicting Water Supply and Rehabilitation of Water Distribution Networks

Water distribution network (WDN) consists of several elements the main ones: pipes and valves. The work developed in this article focuses on a water supply prediction in the short and long term. To this end, reliability...

 Contextual Modelling of Collaboration System

 Faced with new environmental constraints, firms decide to collaborate in collective entities and adopt new patterns of behavior. So, this firms’ collaboration becomes an unavoidable approach. Indeed, our aim intere...

Download PDF file
  • EP ID EP626788
  • DOI 10.14569/IJACSA.2019.0100855
  • Views 101
  • Downloads 0

How To Cite

Yousef Elarian, Irfan Ahmad, Abdelmalek Zidouri, Wasfi G. Al-Khatib (2019). LUCIDAH Ligative and Unligative Characters in a Dataset for Arabic Handwriting. International Journal of Advanced Computer Science & Applications, 10(8), 406-415. https://europub.co.uk/articles/-A-626788