Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification

Journal Title: International Journal on Computer Science and Engineering - Year 2011, Vol 3, Issue 2

Abstract

The phylogenomic classification of protein sequences attempts to categorize a given protein within the evolutionary context of the entire family. It involves mainly four steps: selection of homologous sequences, multiple sequence alignment, phylogenetic tree construction and tree-based classification. This supposes that the tree used as a basis of protein classification is correct. Sequence alignment is the first step for tree construction. Thus, the accuracy of the alignment produced should affect the topology of the phylogenetic tree. This work proposes a kNN tree-based algorithm for protein classification, namely Tree-kNN, which uses a phylogenetic tree estimated from pair-wise and multiple alignment approaches. We compare the classification performance of Tree-kNN with an existing method, called TreeNN. Results show that Tree-kNN gives better results than TreeNN. Based on four datasets we show that classification performances of the two algorithms using pair-wise alignment are better than using multiple alignment

Authors and Affiliations

Khaddouja Boujenfa , Nadia Essoussi , Mohamed Limam

Keywords

Related Articles

AN ARTIFICIAL FISH SWARM OPTIMIZED FUZZY MRI IMAGE SEGMENTATION APPROACH FOR IMPROVING IDENTIFICATION OF BRAIN TUMOUR

In image processing, it is difficult to detect the abnormalities in brain especially in MRI brain images. Also the tumor segmentation from MRI image data is an important; however it is time consuming while carried out by...

A Study on Similarity Computations in Template Matching Technique for Identity Verification

This paper describes a study on the development of a human face verification system by merely using template matching (TM) as the main verification engine. In contrast to common face recognition techniques, our approach...

Generation of a pool of variable size symmetric keys through Image

This paper introduces a new concept of the generation of a unending pool of keys through an image leaving behind the idea of sending keys every time for encryption and decryption. This can help in avoiding the problem of...

Review on Binary Image Steganography and Watermarking

In this paper we have reviewed and analyzed different watermarking and steganography techniques. This is based on image processing in spatial and transform domain. We have reviewed different techniques like data hiding b...

Improved and Balanced LEACH for heterogeneous wireless sensor networks

While wireless sensor networks (WSN) is a power constrained system, since nodes run on limited power batteries which shorten its lifespan. Prolonging the network lifetime depends on efficient management of sensing node e...

Download PDF file
  • EP ID EP160512
  • DOI -
  • Views 144
  • Downloads 0

How To Cite

Khaddouja Boujenfa, Nadia Essoussi, Mohamed Limam (2011). Tree-kNN: A Tree-Based Algorithm for Protein Sequence Classification. International Journal on Computer Science and Engineering, 3(2), 961-968. https://europub.co.uk/articles/-A-160512