Paralyzing Bioinformatics Applications Using Conducive Hadoop Cluster
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2013, Vol 14, Issue 6
Abstract
Bioinformatics may be defined as the application of computer science to molecular biology in the form of statistics and analytics. The bioinformatics applications deal with bulk amount of data. Researchers are now facing problems with the analysis of such ultra large-scale data sets, a problem that will only increase at an alarming rate in coming years. More over big challenge is involved in processing, storing and analyzing these peta bytes of data without causing much delay. Most of the bioinformatics algorithms are sequential thus making situation rather worse. This implies that data manipulations by means of uniprocessor systems are impractical. However most of the biological problems have parallel nature. Hence a practical and effective approach involves the usage of parallel clusters of workstations. Hadoop can be used to tackle this class of problems with good performance and scalability. This technology could be the basis of a computational parallel platform for several problems in the context of bioinformatics applications. Normally, Hadoop is deployed over high performance computing systems which are expensive involving complex deployment scenarios that only big enterprises are able to make it possible. So for smaller research organizations where cost is an important factor cannot choose systems with high computational capabilities for cluster set up. Rocks cluster is a viable solution in such scenarios. Rocks Cluster Distribution originally called NPACI Rocks is a Linux distribution intended for high-performance computing clusters. This paper implements a cost-effective cluster for paralyzing bioinformatics applications by deploying Hadoop over rock cluster and Emphasizes on the usage of commodity clusters for paralyzing bioinformatics applications by providing necessary justifications. Results show that paralyzing bioinformatics application saves much time compared to stand alone mode of execution effectively under optimal cost considerations.
Authors and Affiliations
Bincy P Andrews
Efficient Construction of Dictionary using Directed Acyclic Word Graph
Abstract: Implementation of dictionary is a topic on which research is going on since a long time in the search of a better and efficient algorithm both in terms of space and time complexity. Its necessity has increased...
An Improved Simulation Model for Rayleigh Fading Channels
Abstract:the model of propagation of electromagnetic energy from transmitter to receiver will be largely by way of scatting, either by reflection from the flat sides of buildings or by diffraction around such buildings o...
Internal & External Attacks in cloud computing Environmentfrom confidentiality, integrity and availability points of view
Abstract: Cloud computing is set of resources and services offered through the Internet. Cloud services aredelivered from data centers located throughout the world. Cloud computing facilitates its consumers byprovi...
Performance Measurement of WLAN Based On Medium Access Control for Wirelessly Connected Stations
Abstract: This paper is mainly focuses on the Medium Access Control (MAC) sublayer of the IEEE 802.11 standard for Wireless Local Area Network (WLAN) and delay measurement among the network and also compare of the traffi...
The Effects of Gender on the Economic Status and Social Interaction of Hiv/Aids Infected Youth in Kamptembwo Location, Nakuru County
Abstract: Human immunodeficiency virus (HIV) is a virus that damages cells of the body’s immune system. Acquired immunodeficiency syndrome is a collection of symptoms and infections resulting from damages causedby HIV in...