Multilingual Artificial Text Extraction and Script Identification from Video Images

Abstract

This work presents a system for extraction and script identification of multilingual artificial text appearing in video images. As opposed to most of the existing text extraction systems which target textual occurrences in a particular script or language, we have proposed a generic multilingual text extraction system that relies on a combination of unsupervised and supervised techniques. The unsupervised approach is based on application of image analysis techniques which exploit the contrast, alignment and geometrical properties of text and identify candidate text regions in an image. Potential text regions are then validated by an Artificial Neural Network (ANN) using a set of features computed from Gray Level Co-occurrence Matrices (GLCM). The script of the extracted text is finally identified using texture features based on Local Binary Patterns (LBP). The proposed system was evaluated on video images containing textual occurrences in five different languages including English, Urdu, Hindi, Chinese and Arabic. The promising results of the experimental evaluations validate the effectiveness of the proposed system for text extraction and script identification.

Authors and Affiliations

Akhtar Jamil, Azra Batool, Zumra Malik, Ali Mirza, Imran Siddiqi

Keywords

Related Articles

Local and Semi-Global Feature-Correlative Techniques for Face ?Recognition

Face recognition is an interesting field of computer vision with many commercial and ?scientific applications. It is considered as a very hot topic and challenging problem at the ?moment. Many methods and techniques have...

A Novel Assessment to Achieve Maximum Efficiency in Optimizing Software Failures

Software Reliability is a specialized area of software engineering which deals with the identification of failures while developing the software. Effective analysis of the reliability helps to signify the number of failu...

Haze Effects on Satellite Remote Sensing Imagery and their Corrections

Imagery recorded using satellite sensors operating at visible wavelengths can be contaminated by atmospheric haze that originates from large scale biomass burning. Such issue can reduce the reliability of the imagery and...

Fixed Point Implementation of Tiny-Yolo-v2 using OpenCL on FPGA

Deep Convolutional Neural Network (CNN) algorithm has recently gained popularity in many applications such as image classification, video analytic and object detection. Being compute-intensive and memory expensive, CNN-b...

The Impact of Privacy Concerns and Perceived Vulnerability to Risks on Users Privacy Protection Behaviors on SNS: A Structural Equation Model

This research paper investigates Saudi users’ awareness levels about privacy policies in Social Networking Sites (SNSs), their privacy concerns and their privacy protection measures. For this purpose, a research model th...

Download PDF file
  • EP ID EP149093
  • DOI 10.14569/IJACSA.2016.070469
  • Views 85
  • Downloads 0

How To Cite

Akhtar Jamil, Azra Batool, Zumra Malik, Ali Mirza, Imran Siddiqi (2016). Multilingual Artificial Text Extraction and Script Identification from Video Images. International Journal of Advanced Computer Science & Applications, 7(4), 529-539. https://europub.co.uk/articles/-A-149093