A Hybrid Approach for Complex Layout Detection of Newspapers in Gurumukhi Script Using Deep Learning
Journal Title: International Journal of Experimental Research and Review - Year 2023, Vol 35, Issue 6
Abstract
Layout analysis is the crucial stage in the recognition system of newspapers. A good layout analysis results in better recognition results. The complexity of newspaper layout structures poses a formidable challenge in digitization. The intricate arrangement of text, images, and various sections within a newspaper demands sophisticated algorithms and techniques for accurate layout detection. The paper introduces a diverse set of methodologies from existing literature, highlighting the evolution of techniques for newspaper layout analysis. In this paper, we present a novel method to detect the complex layout of newspapers in the Gurumukhi script by using a hybrid approach. The method developed consists of two parts. In the first part, we proposed an algorithm to remove pictures and graphics from Punjabi newspaper images that involve various image preprocessing tasks based on binarization, finding contours, and erosion on the image to remove the graphics from the image. This method removes pictures from complex non-Manhattan layouts. We have tested this algorithm on 100 newspaper images, giving an accuracy of 96.22%. In the second part, a dataset of 500 newspapers was created with images labeled with five classes on which the model was trained. Finally, we have trained the deep-leaning model based on a convolutional network to detect the columns of text in newspapers. We have used four different architectures of CNN and compared their performance based on different metrics such as precision, recall, and F1 score. We have tested this method on a number of newspapers in the Gurumukhi script. We have achieved an accuracy of 95.53% with this approach.
Authors and Affiliations
Atul Kumar, Gurpreet Singh Lehal
Implications of Cyber-Physical Adversarial Attacks on Autonomous Systems
This study examines hostile cyber-physical assaults on autonomous systems and proposes a novel approach. The recommended strategy integrates many domains, evaluates data quantitatively, and emphasizes real-world applicat...
Identifying and Ranking Critical Motivational Dimensions for the Choice of Wellness Tourism: An Analytic Hierarchy Process (AHP) Approach
This exploratory study aims to determine the elements that impact travellers' decisions to choose wellness tourism. Motivation criteria have been prioritized to determine the most influential reason tourists decide to en...
Effects of oral contraceptive pill on female health
Oral contraceptives, also known as birth control tablets/pills, are medicines that stop pregnancy. About 75% of married women who use contraception in the research location (Purba Medinipur district) reported they prefer...
Securing the Data Using an Efficient Machine Learning Technique
More accessible data and the rise of advanced data analysis contribute to using complex models in decision-making across various fields. Nevertheless, protecting people’s privacy is vital. Medical predictions often emplo...
Performance Analysis of Millimeter-Wave Propagation Characteristics for Various Channel Models in the Indoor Environment
Due to the recent surge in the proliferation of smart wireless devices that feature higher data speeds, there has been a rise in demand for faster indoor data communication services. Moreover, there is a sharp increase i...