Implementing Phylogenetic Distance Based Methods for Tree Construction Using Hierarchical Clustering
Journal Title: International Journal of Computer Science & Engineering Technology - Year 2013, Vol 4, Issue 7
Abstract
Bioinformatics is a data intensive field of research and development. Key problem of knowledge discovery from large and complex databases is deal problem data mining. It is used to discover relationships and patterns in large databases to provide useful information. Clustering is the one of the main techniques for data mining. Phylogeny is the evolutionary history for a set of evolutionary related species. Diagrams that display the phylogeny of a set of taxa in a tree like manner are called phylogenetic trees. One approach on determining the evolutionary histories of a dataset are distance based methods. There are number of different distance based methods of which two are dealt with here: the UPGMA (Unweighted Pair Group Method using Arithmetic average) and Neighbor Joining. These two are clustering based methods. A method for construction of distance based phylogenetic tree using hierarchical clustering is proposed and implemented on different Oryza sativa rice varieties. The sequences are downloaded from NCBI databank. Evolutionary distances are calculated using jukes cantor distance method. Multiple sequence alignment is applied on different datasets. Trees are constructed for different datasets from available data using both the distance based methods. Extractions of closely related varieties are performed by applying threshold condition. Then, final tree is constructed using these closely related varieties.
Authors and Affiliations
Archi Kataria , Dr. Amardeep Singh
An Approach of Zero Correlation Linear Cryptanalysis
Differential and Linear Cryptanalysis are two most popular techniques that have been widely used to attacks block ciphers to reveal its weakness in substitution and permutation network. Most of the block ciphers which ar...
A Study on Reliable Data Delivery for Highly Dynamic MANETs
This paper addresses the problem of delivering data packets in highly dynamic Mobile Ad Hoc network. Existing routing protocols are susceptible to node mobility. To overcome this issue, an efficient Position based Routin...
Performance Comparison of Analog and Digital Radio Over Fiber Link
In order to meet the ever increasing demand for larger transmission bandwidth, Wireless network based on radio-over-fiber technologies is a very beneficial solution. Various disadvantages of Analog Radio over Fiber link...
Operations on Signed Numbers
Signed integers are normally represented using 2’s complement representation. Addition and subtraction of signed numbers is done similar to that of unsigned numbers. However carry (or borrow) is simply ignored. Unlike un...
CLOUDCOMPUTING WITH BIG DATA AS A SERVICE
Bigdata management is becoming crucial now a days because of the evolution of large number of social networking websites,powerful mobile devices,sensors,and cloudcomputing.According to IDC analysis,the global data volume...