A Novel Technique for Parallelization of Genetic Algorithm using Hadoop
Journal Title: INTERNATIONAL JOURNAL OF ENGINEERING TRENDS AND TECHNOLOGY - Year 2013, Vol 4, Issue 8
Abstract
Document categorization is used in education, government sectors, art, industry etc. Categorizing a document to enable immediate finding of it in the future motivated the concept of Classification involving Document categorization. Manual document classification involves a lot of effort and is time consuming. The basic idea implemented in this paper speeds up processing and reduces manual intervention, by atomizing this categorization. This idea is an edge over the existing classification systems. The implementation of the system basically involves getting into to parallelize Genetic Algorithm (GA) thus improving the processing speed. The use of Hadoop MapReduce and HDFS (Hadoop Distributed File System) framework helps to store big data and speeds up the calculations involved in the computation of genetic algorithm. The motivation of this work has reason from mapreduce fare well in terms of scalability, fault tolerance, and ease-of-use. This is adjoined by hadoop being an open-source and hadoop being written in Java
Authors and Affiliations
Ms. Kanchan Sharadchandra Rahate (Khedikar)#1 Prof. L. M. R. J. Lobo
Survey of Watermarking Algorithms For Medical Images
Watermarking is a branch of information hiding which is used to hide proprietary information in digital media like photographs, digital music, or digital video. And also which has seen a lot of research interest...
An Evolutionary Approach for Optimal Citing and Sizing of Micro-Grid in Radial Distribution Systems
This Paper presents the methodology of penetration of Micro-Grids (MG) in the radial distribution system (RDS). The aim of this paper is to minimize a total real power loss that descends the performance of the radial dis...
A Novel Technique for Parallelization of Genetic Algorithm using Hadoop
Document categorization is used in education, government sectors, art, industry etc. Categorizing a document to enable immediate finding of it in the future motivated the concept of Classification involving Documen...
An Empirical Data Cleaning Technique for CFDs
Data cleaning is a basic data preprocessing technique for before forwarding the data to data mining approach ,but it leads to an intresting research area in the field of data mining. Data cleaning is the process of...
A Study on Modeling Standards for Web Applications and Significance of AspectWebML
Standard UML fails to model web applications effectively due to their complex and dynamic behavior. For better modeling, UML was extended with UWE and also WebML, which fully supports OOP principles, was evolved. A...