Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2

Abstract

The phylogenomic classification of protein sequences attempts to categorize a given protein within the evolutionary context of the entire family. It involves mainly four steps: selection of homologous sequences, multiple sequence alignment, phylogenetic tree construction and tree-based classification. This supposes that the tree used as a basis of protein classification is correct. Sequence alignment is the first step for tree construction. Thus, the accuracy of the alignment produced should affect the topology of the phylogenetic tree. This work proposes a kNN tree-based algorithm for protein classification, namely Tree-kNN, which uses a phylogenetic tree estimated from pair-wise and multiple alignment approaches. We compare the classification performance of Tree-kNN with an existing method, called TreeNN. Results show that Tree-kNN gives better results than TreeNN. Based on four datasets we show that classification performances of the two algorithms using pair-wise alignment are better than using multiple alignment

Authors and Affiliations

Khaddouja Boujenfa , Nadia Essoussi , Mohamed Limam

Keywords

Related Articles

PERFORMANCE OF MULTI SERVER AUTHENTICATION AND KEY AGREEMENT WITH USER PROTECTION IN NETWORK SECURITY

Using smart cards, remote user authentication and key greement can be simplified, flexible, and efficient for creating a secure distributed computers environment. Addition to user authentication and key distribution, it...

An Efficient Data Link Protocol for Integrated Wireless Networks: Next Generation Networks (NGN)

The seamless integrated communication has a vital role in pervasive communication. It has implemented on next generation networking. There is various integration issues: coupling, decoupling, mobility, IP etc. In this pa...

Solving Sparse Rating Problem Using Fine Grained Approach

Recommender System is a system that automatically recommends all similar kind of items that are of user interest. In design of the recommender systems rating is the crucial issue. Till today many algorithms have been pro...

IROBOT CREATE: PROGRAMMED AS PROVIDER

This paper elucidates the research and implementation of guiding robots to a predetermined path. The robot is programmed through an 8 bit micro controller. The input to the robot for a particular path can be given by pre...

A Scheduling Approach with Processor and Network Heterogeneity for Grid Environment

Processor heterogeneity is an important issue in grid environment. In this paper, a list based task scheduling algorithm, called “critical path scheduling with t-level” (CPST) for grid computing system is proposed. There...

Download PDF file
  • EP ID EP160512
  • DOI -
  • Views 153
  • Downloads 0

How To Cite

Khaddouja Boujenfa, Nadia Essoussi, Mohamed Limam (2011). Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification. International Journal on Computer Science and Engineering, 3(2), 961-968. https://europub.co.uk/articles/-A-160512