Text Separation from Graphics by Analyzing Stroke Width Variety in Persian City Maps

Abstract

Text segmentation is a live research field with vast new areas to be explored. Separating text layer from graphics is a fundamental step to exploit text and graphics information. The language used in the map is a challenging issue in text layer separation problem. All current methods are proposed for non-Persian language maps. In Persian, text strings are composed of one or more subwords. Each subword is also composed of one to several letters connected together. Therefore, the components of the text strings in Persian are more diverse in terms of size and geometric form than in English. Thus, the overlapping of the Persian text and the lines usually produces a complex structure that the existing methods cannot handle with the necessary efficiency. For this purpose, the stroke width variety of the input map is calculated, and then the average line width of graphics is estimated by analyzing the content of stroke width. After finding the average width of graphical lines, we classify the complex structure into text and graphics in pixel level. We evaluate our method on some variety of full crossing text and graphics in Persian maps and show that some promising results in terms of precision and recall (above 80% and 90%, respectively) are obtained.

Authors and Affiliations

Ali Ghafari- Beranghar, Ehsanollah Kabir, Kaveh Kangarloo

Keywords

Related Articles

Dynamic Inertia Weight Particle Swarm Optimization for Solving Nonogram Puzzles

Particle swarm optimization (PSO) has shown to be a robust and efficient optimization algorithm therefore PSO has received increased attention in many research fields. This paper demonstrates the feasibility of applying...

Novel Carrier based PWM Techniques Reduce Common Mode Voltage for Six Phase Induction Motor Drives

This paper proposes a novel pulse width modulation (CBPWM) technique for reducing the common mode voltage for a six-phase induction motor (SPIM) drive. This proposed CBPWM technique relies on setting up offset functions...

Simulating Cooperative Systems Applications: a New Complete Architecture

For a decade, embedded driving assistance systems were mainly dedicated to the management of short time events (lane departure, collision avoidance, collision mitigation). Recently a great number of projects have been fo...

An Enhanced Weighted Associative Classification Algorithm without Preassigned Weight based on Ranking Hubs

Heart disease is the preeminent reasons for death worldwide and in excess of 17 million individuals were kicked the bucket from heart disease in the past years and the mortality rate will be increased in upcoming years r...

Security Issues in Cloud Computing and their Solutions: A Review

Cloud computing is an internet-based, emerging technology, tends to be prevailing in our environment especially computer science and information technology fields which require network computing on large scale. Cloud com...

Download PDF file
  • EP ID EP322108
  • DOI 10.14569/IJACSA.2018.090632
  • Views 103
  • Downloads 0

How To Cite

Ali Ghafari- Beranghar, Ehsanollah Kabir, Kaveh Kangarloo (2018). Text Separation from Graphics by Analyzing Stroke Width Variety in Persian City Maps. International Journal of Advanced Computer Science & Applications, 9(6), 222-229. https://europub.co.uk/articles/-A-322108