Implementing Phylogenetic Distance Based Methods for Tree Construction Using Hierarchical Clustering

Abstract

Bioinformatics is a data intensive field of research and development. Key problem of knowledge discovery from large and complex databases is deal problem data mining. It is used to discover relationships and patterns in large databases to provide useful information. Clustering is the one of the main techniques for data mining. Phylogeny is the evolutionary history for a set of evolutionary related species. Diagrams that display the phylogeny of a set of taxa in a tree like manner are called phylogenetic trees. One approach on determining the evolutionary histories of a dataset are distance based methods. There are number of different distance based methods of which two are dealt with here: the UPGMA (Unweighted Pair Group Method using Arithmetic average) and Neighbor Joining. These two are clustering based methods. A method for construction of distance based phylogenetic tree using hierarchical clustering is proposed and implemented on different Oryza sativa rice varieties. The sequences are downloaded from NCBI databank. Evolutionary distances are calculated using jukes cantor distance method. Multiple sequence alignment is applied on different datasets. Trees are constructed for different datasets from available data using both the distance based methods. Extractions of closely related varieties are performed by applying threshold condition. Then, final tree is constructed using these closely related varieties.

Authors and Affiliations

Archi Kataria , Dr. Amardeep Singh

Keywords

Related Articles

An Approach of Zero Correlation Linear Cryptanalysis

Differential and Linear Cryptanalysis are two most popular techniques that have been widely used to attacks block ciphers to reveal its weakness in substitution and permutation network. Most of the block ciphers which ar...

A Study on Reliable Data Delivery for Highly Dynamic MANETs

This paper addresses the problem of delivering data packets in highly dynamic Mobile Ad Hoc network. Existing routing protocols are susceptible to node mobility. To overcome this issue, an efficient Position based Routin...

Performance Comparison of Analog and Digital Radio Over Fiber Link

In order to meet the ever increasing demand for larger transmission bandwidth, Wireless network based on radio-over-fiber technologies is a very beneficial solution. Various disadvantages of Analog Radio over Fiber link...

Operations on Signed Numbers

Signed integers are normally represented using 2’s complement representation. Addition and subtraction of signed numbers is done similar to that of unsigned numbers. However carry (or borrow) is simply ignored. Unlike un...

CLOUDCOMPUTING WITH BIG DATA AS A SERVICE

Bigdata management is becoming crucial now a days because of the evolution of large number of social networking websites,powerful mobile devices,sensors,and cloudcomputing.According to IDC analysis,the global data volume...

Download PDF file
  • EP ID EP125637
  • DOI -
  • Views 122
  • Downloads 0

How To Cite

Archi Kataria, Dr. Amardeep Singh (2013). Implementing Phylogenetic Distance Based Methods for Tree Construction Using Hierarchical Clustering. International Journal of Computer Science & Engineering Technology, 4(7), 890-901. https://europub.co.uk/articles/-A-125637