Implementing Phylogenetic Distance Based Methods for Tree Construction Using Hierarchical Clustering
Journal Title: International Journal of Computer Science & Engineering Technology - Year 2013, Vol 4, Issue 7
Abstract
Bioinformatics is a data intensive field of research and development. Key problem of knowledge discovery from large and complex databases is deal problem data mining. It is used to discover relationships and patterns in large databases to provide useful information. Clustering is the one of the main techniques for data mining. Phylogeny is the evolutionary history for a set of evolutionary related species. Diagrams that display the phylogeny of a set of taxa in a tree like manner are called phylogenetic trees. One approach on determining the evolutionary histories of a dataset are distance based methods. There are number of different distance based methods of which two are dealt with here: the UPGMA (Unweighted Pair Group Method using Arithmetic average) and Neighbor Joining. These two are clustering based methods. A method for construction of distance based phylogenetic tree using hierarchical clustering is proposed and implemented on different Oryza sativa rice varieties. The sequences are downloaded from NCBI databank. Evolutionary distances are calculated using jukes cantor distance method. Multiple sequence alignment is applied on different datasets. Trees are constructed for different datasets from available data using both the distance based methods. Extractions of closely related varieties are performed by applying threshold condition. Then, final tree is constructed using these closely related varieties.
Authors and Affiliations
Archi Kataria , Dr. Amardeep Singh
COMPREHENSIVE STUDY AND COMPARISON OF INFORMATION DISPERSAL TECHNIQUES FOR CLOUD COMPUTING
Cloud systems refer to the collection of interconnected servers that are provisioned dynamically on demand, for execution of applications, to the customer like electricity grid. Cloud computing has gained great attention...
Comparative Study on Text Pattern Matching for Heterogeneous System
Pattern-matching has been routinely used in various computer applications, for example, in editors, retrieval of information either textual, image, or sound and searching nucleotide or amino acid sequence patterns in gen...
Applications of ANNs in Stock Market Prediction: A Survey
This paper surveys recent literature in the domain of machine learning techniques and artificial intelligence used to predict stock market movements. Artificial Neural Networks (ANNs) are identified to be the dominant ma...
THE IMPLEMENTATION OF PROGRAMMABLE WIRELESS ROUTER USING JAVA
Nowadays available wireless routers do have extra levels of embedded security. Existing Wireless routers can be configured for invisible mode. So that wireless network cannot be scanned by outside wireless clients. If th...
Concerto of Remedy Network Dais on Side Effects amongst Claim for Remedy Entrust
Remedies with similar side-effect silhouette may share comparable beneficial assets from beginning to end interrelated apparatus of deed. In this study, a remedy-remedy intricate was erect pedestal on the resemblance ami...