Document Image Binarization Using Independent Component Analysis For OCR

Abstract

 The Image binarization plays a vital role in text segmentation which is used in OCR application. Binarization of text in degraded images is a challenging task due to the variations in size, color and font of the text and the results be often affected by complex backgrounds, dissimilar lighting conditions, reflections and shadow. A robust solution to this problem can significantly enhance the precision of scene text recognition algorithms leading to a variety of applications such as scene understanding, navigation, automatic localization and image retrieval. In this paper, we propose a novel method to extract and binarize text as of images that contains complex background. We apply an Independent Component Analysis (ICA) based technique to map out the text region, which is uniform in nature, while removing specularity, shadows and reflections, which are included in the background. This algorithm works better on images with different degradations. We implement our method on various DIBCO datasets.

Authors and Affiliations

Varada Sreeja

Keywords

Related Articles

 Corrosion Behavior of Amalgam/Cockles Shells Composites in Artificial Saliva

 Metal matrix composites have been prepared by adding cockles’ shells CS powder to dental amalgam with three weight percents 14.2, 28.5 and 42.8 wt% as an attempt to improve the corrosion behavior of the final prod...

 Optimize Parity Encoding for Power Reduction in Content Addressable Memory

 Most memory devices store and retrieve data by addressing specific memory locations. As a result, this path often becomes the limiting factor for systems that rely on fast memory accesses. The time required to fin...

 Free Vibration Analysis of Orthotropic Thin Rectangular SSSS Plate Using Polynomial Series Function

 Free vibration analysis of material orthotropic rectangular thin plate simply supported on all edges (SSSS) plate using Taylor’s series function in Ritz method was carried out. A Taylor’s series truncated at the 5...

 Routing Strategies in Adhoc Network

 Wireless Communication is the transfer of information over long distances without the use of wires. The connectivity of wireless communication is nearly everywhere and becoming highly affordable even for people wh...

Contra v-Closed Mappings

The aim of this paper is to introduce and study the concept of Contra v

Download PDF file
  • EP ID EP158692
  • DOI -
  • Views 80
  • Downloads 0

How To Cite

Varada Sreeja (30).  Document Image Binarization Using Independent Component Analysis For OCR. International Journal of Engineering Sciences & Research Technology, 3(9), 161-166. https://europub.co.uk/articles/-A-158692