Document Image Binarization Using Independent Component Analysis For OCR

Abstract

 The Image binarization plays a vital role in text segmentation which is used in OCR application. Binarization of text in degraded images is a challenging task due to the variations in size, color and font of the text and the results be often affected by complex backgrounds, dissimilar lighting conditions, reflections and shadow. A robust solution to this problem can significantly enhance the precision of scene text recognition algorithms leading to a variety of applications such as scene understanding, navigation, automatic localization and image retrieval. In this paper, we propose a novel method to extract and binarize text as of images that contains complex background. We apply an Independent Component Analysis (ICA) based technique to map out the text region, which is uniform in nature, while removing specularity, shadows and reflections, which are included in the background. This algorithm works better on images with different degradations. We implement our method on various DIBCO datasets.

Authors and Affiliations

Varada Sreeja

Keywords

Related Articles

 THE IMPACT OF TOTAL PRODUCTIVE MAINTENANCE (TPM) ON MANUFACTURING PERFORMANCE

 [b]In the Indian context, most of the manufacturing organizations are currently in the introductory stage of TPM implementation. TPM is practical technique aimed at maximizing the effectiveness of facility that we...

SEISMIC ANALYSIS OF MULTI-STOREY R.C. STRUCTURE USING BRACING SYSTEM AND FLOOR DIAPHRAGM

Earthquakes are natural hazards which cause disasters are mainly caused by damage too or collapse of buildings and other manmade structures. Experience has shown that for new constructions, establishing seismic resistan...

 Comparative Analysis between PI and Wavelet Transform for the Fault Detection in Induction Motor

 Squirrel cage Induction motor is widely used in industries because roughest construction, highly reliable, low cost, high efficiency, user friendly and maintenance is minimum as compare to other motor. Induction m...

GENETIC ALGORITHM BASED AVAILABLE TRANSFER CAPABILITY CALCULATIONS

In the present power market a large number of transmi ssion transactions are required to meet the heavy competition in the electrical industry. These transactions are limited by the system design and operating conditi...

  PI Control Based DC Drive Speed Controller Responses for Small Load Torque

 The separately excited Direct current (DC) motors with conventional Proportional controller is generally used in industry. This can be easily implemented and are found to be highly effective if the load changes a...

Download PDF file
  • EP ID EP158692
  • DOI -
  • Views 90
  • Downloads 0

How To Cite

Varada Sreeja (30).  Document Image Binarization Using Independent Component Analysis For OCR. International Journal of Engineering Sciences & Research Technology, 3(9), 161-166. https://europub.co.uk/articles/-A-158692