Genetic Programming for Document Segmentation and Region Classification Using Discipulus

Abstract

Document segmentation is a method of rending the document into distinct regions. A document is an assortment of information and a standard mode of conveying information to others. Pursuance of data from documents involves ton of human effort, time intense and might severely prohibit the usage of data systems. So, automatic information pursuance from the document has become a big issue. It is been shown that document segmentation will facilitate to beat such problems. This paper proposes a new approach to segment and classify the document regions as text, image, drawings and table. Document image is divided into blocks using Run length smearing rule and features are extracted from every blocks. Discipulus tool has been used to construct the Genetic programming based classifier model and located 97.5% classification accuracy.

Authors and Affiliations

Priyadharshini N , Vijaya MS

Keywords

Related Articles

 Comparative Study on Cloud Parameter Estimation Among GOSAT/CAI, MODIS, CALIPSO/CALIOP and Landsat-8/OLI with Laser Radar: Lidar as Truth Data

 A comparative study on cloud parameter estimation among GOSAT/CAI, MODIS, CALIPSO/CALIOP and Landsat-8/OLI is carried out using Laser Radar: Lidar as a truth data. Optical depth, size distribution, as well as cirru...

 Framework for Knowledge–Based Intelligent Clinical Decisionsupport to Predict Comorbidity

 Research in medicine has shown that comorbidity is prevalent among chronic diseases. In ophthalmology, it is used to refer to the overlap of two or more ophthalmic disorders. The comorbidity of cataract and glaucom...

 Improved Framework for Breast Cancer Detection using Hybrid Feature Extraction Technique and FFNN

 Breast Cancer early detection using terminologies of image processing is suffered from the less accuracy performance in different automated medical tools. To improve the accuracy, still there are many research stud...

PREDICTION OF ASSETS BEHAVIOR IN FINANCIAL SERIES USING MACHINE LEARNING ALGORITHMS

The prediction of financial assets using either classification or regression models, is a challenge that has been growing in the recent years, despite the large number of publications of forecasting models for this task....

 Semantic Image Retrieval: An Ontology Based Approach

 Images / Videos are major source of content on the internet and the content is increasing rapidly due to the advancement in this area. Image analysis and retrieval is one of the active research field and researcher...

Download PDF file
  • EP ID EP109131
  • DOI -
  • Views 101
  • Downloads 0

How To Cite

Priyadharshini N, Vijaya MS (2013). Genetic Programming for Document Segmentation and Region Classification Using Discipulus. International Journal of Advanced Research in Artificial Intelligence(IJARAI), 2(2), 15-22. https://europub.co.uk/articles/-A-109131