Genetic Programming for Document Segmentation and Region Classification Using Discipulus
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2013, Vol 2, Issue 2
Abstract
Document segmentation is a method of rending the document into distinct regions. A document is an assortment of information and a standard mode of conveying information to others. Pursuance of data from documents involves ton of human effort, time intense and might severely prohibit the usage of data systems. So, automatic information pursuance from the document has become a big issue. It is been shown that document segmentation will facilitate to beat such problems. This paper proposes a new approach to segment and classify the document regions as text, image, drawings and table. Document image is divided into blocks using Run length smearing rule and features are extracted from every blocks. Discipulus tool has been used to construct the Genetic programming based classifier model and located 97.5% classification accuracy.
Authors and Affiliations
Priyadharshini N , Vijaya MS
A New Technique to Manage Big Bioinformatics Data Using Genetic Algorithms
The continuous growth of data, mainly the medical data at laboratories becomes very complex to use and to manage by using traditional ways. So, the researchers start studying genetic information field which increas...
Information-Theoretic Active SOM for Improving Generalization Performance
In this paper, we introduce a new type of information-theoretic method called “information-theoretic active SOM”, based on the self-organizing maps (SOM) for training multi-layered neural networks. The SOM is one o...
Brain Computer Interface Boulevard of Smarter Thoughts
The Brain Computer Interface is a major breakthrough for the technical industry, medical world, military and the society on a whole. It is concerned with the control of devices around us such as computing gears & eve...
Comparative Study on Cloud Parameter Estimation Among GOSAT/CAI, MODIS, CALIPSO/CALIOP and Landsat-8/OLI with Laser Radar: Lidar as Truth Data
A comparative study on cloud parameter estimation among GOSAT/CAI, MODIS, CALIPSO/CALIOP and Landsat-8/OLI is carried out using Laser Radar: Lidar as a truth data. Optical depth, size distribution, as well as cirru...
Category Decomposition Method for Un-Mixing of Mixels Acquired with Spaceborne Based Visible and Near Infrared Radiometers by Means of Maximum Entropy Method with Parameter Estimation Based on Simulated Annealing
Category decomposition method for un-mixing of mixels (Mixed Pixels) which is acquired with spaceborne based visible to near infrared radiometers by means of Maximum Entropy Method (MEM) with parameter estimation b...