An Implementation of Sequential Rule Mining Using Mapreduce Based Genetic Algorithm
Journal Title: Scholars Journal of Engineering and Technology - Year 2017, Vol 5, Issue 7
Abstract
Sequential rule mining is a fundamental technique of data mining which has many application one of which is in the area of bioinformatics. Bioinformatics is an application of information technology to store gigantic biological data in an organised way so that the data can easily be analysed. These biological data is in form of sequences of proteins and sequence analysis is one of the major research areas in bioinformatics. These Bioinformatics data are incremental datasets and at the present era of cheap information technology these bioinformatics data has become big data. Efficient mining of big data require parallel as well as iterative techniques. Here we propose a technique to analyse these sequential data and extract sequential rule using map reduce Framework of Hadoop mounted over genetic algorithm. Map reduce is responsible for parallelizing of mining technique where as genetic algorithm would work in an iterative manner to generate sequential rules from DNA sequences. Keywords: Data mining, rule mining, Big data, Hadoop, Mapreduce, Genetic Algorithm.
Authors and Affiliations
Amanjeet Kour, Om prakash Dewangan, Toran verma
Assessment of water quality of river Yamuna in Yamunanagar, India with reference to planktons and macrozoobenthos
This paper presents an assessment of water quality of the river Yamuna, when it meanders along the city Yamunanagar, India and is subjected to sewage and industrial pollution. The analysis of various pollution parameters...
Classification and Blocking of Spam Users based on Review Using Expected Maximization Algorithm
An excellent source of collecting the reviews on specific product is various online shopping sites where people share their reviews on products and their shopping experience. People may come through the wrong opinions kn...
Localization Techniques in WSN: A Review
Wireless Sensor Network (WSN) is a kind of wireless network, which is composed of groups of very small devices known as ‘Sensor Nodes’. It is one of the fastest evolving fields. Applications of WSN comprise of search, di...
Low-cost Brain-Computer Interfaces
This article reviews and establishes the current state of research and technology for low-cost, portable and easy to use Brain Computer Interface (BCI) suitable for non-medical applications such as communication, environ...
A Linear Programming Model of Multi-class Support Vector Machine
The structure of K-SVCR algorithm is ‘one-against-one-against-rest’.Its advantage is in the process of each decomposition , make all training points of information have been fully taken advantage of. To a certain extent...