A New Technique to Manage Big Bioinformatics Data Using Genetic Algorithms
Journal Title: International Journal of Advanced Research in Artificial Intelligence(IJARAI) - Year 2016, Vol 5, Issue 6
Abstract
The continuous growth of data, mainly the medical data at laboratories becomes very complex to use and to manage by using traditional ways. So, the researchers start studying genetic information field which increased in the past thirty years in bioinformatics domain (the computer science field, genetic biology field, and DNA). This growth of data becomes known as big bioinformatics data. Thus, efficient algorithms such as Genetic Algorithms are needed to deal with this big and vast amount of bioinformatics data in genetic laboratories. So the researchers proposed two models to manage the big bioinformatics data in addition to the traditional model. The first model by applying Genetic Algorithms before MapReduce, the second model by applying Genetic Algorithms after the MapReduce, and the original or the traditional model by applying only MapReduce without using Genetic Algorithms. The three models were implemented and evaluated using big bioinformatics data collected from the Duchenne Muscular Dystrophy (DMD) disorder. The researchers conclude that the second model is the best one among the three models in reducing the size of the data, in execution time, and in addition to the ability to manage and summarize big bioinformatics data. Finally by comparing the percentage errors of the second model with the first model and the traditional model, the researchers obtained the following results 1.136%, 10.227%, and 11.363% respectively. So the second model is the most accurate model with the less percentage error.
Authors and Affiliations
Huda Jalil Dikhil, Mohammad Shkoukani, Suhail Owais
Parameter Optimization for Nadaraya-Watson Kernel Regression Method with Small Samples
Many current regression algorithms have unsatisfactory prediction accuracy with small samples. To solve this problem, a regression algorithm based on Nadaraya-Watson kernel regression (NWKR) is proposed. The propos...
Introduction of the weight edition errors in the Levenshtein distance
In this paper, we present a new approach dedicated to correcting the spelling errors of the Arabic language. This approach corrects typographical errors like inserting, deleting, and permutation. Our method is inspired f...
An Interval-Based Context Reasoning Approach
Context-aware computing is an emerging computing paradigm that provides intelligent context-aware application. Context reasoning is an important aspect in context awareness, by which high level context can be derived fro...
Rice Crop Field Monitoring System with Radio Controlled Helicopter Based Near Infrared Cameras Through Nitrogen Content Estimation and Its Distribution Monitoring
Rice crop field monitoring system with radio controlled helicopter based near infrared cameras is proposed together with nitrogen content estimation method for monitoring its distribution in the field in concern. T...
Enhanced Tunneling Technique for Flow-Based Fast Handover in Proxy Mobile Ipv6 Networks
In the Mobile IPv6 network, each node is highly mobile and handoff is a very common process. When not processed efficiently, the handoff process may result in large amount of packet loss. If the handover process is...