Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2

Abstract

The phylogenomic classification of protein sequences attempts to categorize a given protein within the evolutionary context of the entire family. It involves mainly four steps: selection of homologous sequences, multiple sequence alignment, phylogenetic tree construction and tree-based classification. This supposes that the tree used as a basis of protein classification is correct. Sequence alignment is the first step for tree construction. Thus, the accuracy of the alignment produced should affect the topology of the phylogenetic tree. This work proposes a kNN tree-based algorithm for protein classification, namely Tree-kNN, which uses a phylogenetic tree estimated from pair-wise and multiple alignment approaches. We compare the classification performance of Tree-kNN with an existing method, called TreeNN. Results show that Tree-kNN gives better results than TreeNN. Based on four datasets we show that classification performances of the two algorithms using pair-wise alignment are better than using multiple alignment

Authors and Affiliations

Khaddouja Boujenfa , Nadia Essoussi , Mohamed Limam

Keywords

Related Articles

Mathematical algorithms for determination of mixed layer height from laser radar signals

This paper describes different mathematical algorithms used in the determination of mixed layer height (MLH) from the laser radar (lidar) signals. These methods are successfully applied to the indigenously developed port...

The K-Means Clustering used in Wireless Sensor Network

The past few years have witnessed increased interest in the potential use of wireless sensor networks in applications such as environment management and various surveillance. The Sensor nodes in these applications are ex...

SUBJECTIVE CONTENT ACCESSIBILITY USING DATABASE APPROACH FOR DIGITAL LIBRARY

Today’s digital library is a massive collection of various types and categories of documents. The existing search engines do not provide subjective search from the collection, as no information about context is stored. T...

Meta-Content framework for back index generation

Book reading is a common thing which every one of us does in our life. A common strategy to spot a page for reading is to use front index and back index. A front index generally contains the sections and subsections topi...

Design and Development of Wireless Sensor Node

This paper presents design and development of intelligent sensor node for environmental monitoring. The node is equipped with multimode sensors for sensing different environmental parameters, the node can sense four diff...

Download PDF file
  • EP ID EP160512
  • DOI -
  • Views 121
  • Downloads 0

How To Cite

Khaddouja Boujenfa, Nadia Essoussi, Mohamed Limam (2011). Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification. International Journal on Computer Science and Engineering, 3(2), 961-968. https://europub.co.uk/articles/-A-160512