Document Image Binarization Using Independent Component Analysis For OCR

Abstract

 The Image binarization plays a vital role in text segmentation which is used in OCR application. Binarization of text in degraded images is a challenging task due to the variations in size, color and font of the text and the results be often affected by complex backgrounds, dissimilar lighting conditions, reflections and shadow. A robust solution to this problem can significantly enhance the precision of scene text recognition algorithms leading to a variety of applications such as scene understanding, navigation, automatic localization and image retrieval. In this paper, we propose a novel method to extract and binarize text as of images that contains complex background. We apply an Independent Component Analysis (ICA) based technique to map out the text region, which is uniform in nature, while removing specularity, shadows and reflections, which are included in the background. This algorithm works better on images with different degradations. We implement our method on various DIBCO datasets.

Authors and Affiliations

Varada Sreeja

Keywords

Related Articles

 MODIFIED AODV PROTOCOL FOR ENERGY EFFICIENT ROUTING IN MANET

 Mobile ad Hoc network is a collection of wireless mobile nodes that works without any fixed infrastructure. Mobile nodes in MANET are featured with limited battery power & performance of routing protocol degra...

 AN OVERVIEW OF CAR SPEED CONTROL USING BLUETOOTH AND SENSORS

 Imagine a world where billions of objects can sense, communicate and share information, all interconnected over public or private Internet Protocol (IP) networks. These interconnected objects have data regularly c...

 A PERSONALIZED SCHEME FOR INCOMPLETE AND DUPLICATE INFORMATION HANDLING IN RELATIONAL DATABASES

 Missing data replacement is a crucial process in most real world databases. Due to the tremendous improvement of data management, users of such database can effectively manage the incompleteness using their customi...

Estimation of Weibull Parameters In Accelerated Life Testing Using Geometric Process With Type-Ii Censored Data

In Accelerated life testing (ALT), generally, the log linear function between life and stress is used to obtain the estimates of original parameters of the life. The log linear is just a simple re-parameterization of th...

 WITRICITY: THE TECHNOLOGICAL MIRACLE OF WIRELESS ELECTRICITY TRANSFER

 The main theme of this paper is to transfer power wirelessly using the concept of highly resonant coupling. Witricity can make an amazing change by removing the use of conventional copper cables and current carryi...

Download PDF file
  • EP ID EP158692
  • DOI -
  • Views 78
  • Downloads 0

How To Cite

Varada Sreeja (30).  Document Image Binarization Using Independent Component Analysis For OCR. International Journal of Engineering Sciences & Research Technology, 3(9), 161-166. https://europub.co.uk/articles/-A-158692