Table Detection and Extraction from Image Document

Abstract

Tables make information easier to understand and perceive than regular text block. Now days, it  becomes popular structure for information representation. Format of tables differs and change according to need of representation of information. Various format of table makes it difficult for OCR system to recognize and just segment as an Image block. We proposed a novel approach which can detect all type of table format from single column image document. Tables are categorized in three type based of their rows and column separator.Type1 table have line as row and column separator. Type2 table have horizontal line for separating rows and space for separating column. In Type3 tables only space are used as both row and columns separator. Tables are detected from image documents based on simple projection profile and hough line detection method. We have tested this approach with 1200 image documents which contains all type of table format and get 89% accurate result.

Authors and Affiliations

Tanushree Dhiran , Rakesh Sharma

Keywords

Related Articles

 An Integrated Approach to Measurement Software Defect using Software Matrices

 Software measurement is a quantified attribute of a characteristic of a software product or the software process. It is a discipline within software engineering. Measurement programs in software organizations are a...

 SE code optimization using Data Mining Approach

 Data mining also holds promises for other software engineering processes, which have to deal with uncertainty and intangible data such as cost estimation, effort estimation and quality. It can also aid in interesti...

Image Steganography Using Dynamic LSB with Blowfish Algorithm

The past few years have seen an increasing interest in using images as cover media for  Steganography communication.The basic structure of Steganography is made up of three components: cover image, message, and the...

 Security requirements in Software Requirements Engineering

 In the last few decades, software projects have encountered major difficulties. Most software engineering projects tend to be late and over budget. Several of the causes of these failures are related to requirement...

 A Novel Encryption approach in Database Security

 Database servers are the most important thing in company environment. They store client details, financial information, human resource details and all the data are should be keptas very secret. Database security&nb...

Download PDF file
  • EP ID EP120818
  • DOI -
  • Views 114
  • Downloads 0

How To Cite

Tanushree Dhiran, Rakesh Sharma (2013). Table Detection and Extraction from Image Document. International Journal of Computer & organization Trends(IJCOT), 3(7), 275-278. https://europub.co.uk/articles/-A-120818