Text Separation from Graphics by Analyzing Stroke Width Variety in Persian City Maps

Abstract

Text segmentation is a live research field with vast new areas to be explored. Separating text layer from graphics is a fundamental step to exploit text and graphics information. The language used in the map is a challenging issue in text layer separation problem. All current methods are proposed for non-Persian language maps. In Persian, text strings are composed of one or more subwords. Each subword is also composed of one to several letters connected together. Therefore, the components of the text strings in Persian are more diverse in terms of size and geometric form than in English. Thus, the overlapping of the Persian text and the lines usually produces a complex structure that the existing methods cannot handle with the necessary efficiency. For this purpose, the stroke width variety of the input map is calculated, and then the average line width of graphics is estimated by analyzing the content of stroke width. After finding the average width of graphical lines, we classify the complex structure into text and graphics in pixel level. We evaluate our method on some variety of full crossing text and graphics in Persian maps and show that some promising results in terms of precision and recall (above 80% and 90%, respectively) are obtained.

Authors and Affiliations

Ali Ghafari- Beranghar, Ehsanollah Kabir, Kaveh Kangarloo

Keywords

Related Articles

Deployment Protocol for Underwater Wireless Sensors Network based on Virtual Force

Recently, Underwater Sensor Networks (UWSNs) have attracted researchers’ attention due to the challenges and the peculiar characteristics of the underwater environment. The initial random deployment of UWSN where sensors...

Architecture of a Mediation System for Mobile Payment

Nowadays, the mobile phone has become an indispensable part of our daily. Exceeding the role of a communication apparatus, and benefitting from the evolution of technology, it could be used for several uses other than te...

Task Scheduling Frameworks for Heterogeneous Computing Toward Exascale

The race for Exascale Computing has naturally led computer architecture to transit from the multicore era and into the heterogeneous era. Many systems are shipped with integrated CPUs and graphics processing units (GPUs)...

Face Recognition System Based on Different Artificial Neural Networks Models and Training Algorithms

Face recognition is one of the biometric methods that is used to identify any given face image using the main features of this face. In this research, a face recognition system was suggested based on four Artificial Neur...

Communication and Computation Aware Task Scheduling Framework Toward Exascale Computing

The race for Exascale Computing has naturally led computer architecture to transit from the multicore era and into the heterogeneous era. Exascale Computing within the heterogenous environment necessarily use the best-fi...

Download PDF file
  • EP ID EP322108
  • DOI 10.14569/IJACSA.2018.090632
  • Views 75
  • Downloads 0

How To Cite

Ali Ghafari- Beranghar, Ehsanollah Kabir, Kaveh Kangarloo (2018). Text Separation from Graphics by Analyzing Stroke Width Variety in Persian City Maps. International Journal of Advanced Computer Science & Applications, 9(6), 222-229. https://europub.co.uk/articles/-A-322108