An Effective Identification of Species from DNA Sequence: A Classification Technique by Integrating DM and ANN

Abstract

Species classification from DNA sequences remains as an open challenge in the area of bioinformatics, which deals with the collection, processing and analysis of DNA and proteomic sequence. Though incorporation of data mining can guide the process to perform well, poor definition, and heterogeneous nature of gene sequence remains as a barrier. In this paper, an effective classification technique to identify the organism from its gene sequence is proposed. The proposed integrated technique is mainly based on pattern mining and neural network-based classification. In pattern mining, the technique mines nucleotide patterns and their support from selected DNA sequence. The high dimension of the mined dataset is reduced using Multilinear Principal Component Analysis (MPCA). In classification, a well-trained neural network classifies the selected gene sequence and so the organism is identified even from a part of the sequence. The proposed technique is evaluated by performing 10-fold cross validation, a statistical validation measure, and the obtained results prove the efficacy of the technique.

Authors and Affiliations

Sathish S, Dr. N. Duraipandian

Keywords

Related Articles

Construction of FuzzyFind Dictionary using Golay Coding Transformation for Searching Applications

searching through a large volume of data is very critical for companies, scientists, and searching engines applications due to time complexity and memory complexity. In this paper, a new technique of generating FuzzyFind...

Review of Energy Reduction Techniques for Green Cloud Computing

The growth of cloud computing has led to uneconomical energy consumption in data processing, storage, and communications. This is unfriendly to the environment, because of the carbon emissions. Therefore, green IT is req...

Management Information Systems in Public Institutions in Jordan

Six constructs were utilized in this study to explore the factors affecting MIS implementation in Jordanian public institutions and to investigate the impact of MIS implementation on organizational (operational) performa...

Detection and Isolation of Packet Dropping Attacker in MANETs

Several approaches have been proposed for Intrusion Detection Systems (IDS) in Mobile Ad hoc Networks (MANETs). Due to lack of MANETs infrastructure and well defined perimeter MANETs are susceptible to a variety of attac...

 An Effective Reasoning Algorithm for Question Answering System

 Knowledge representation (KR) is the most desirable area of research to make the system intelligent. Today is the era of knowledge that requires articulations, semantic, syntax etc. These requirements, forced to de...

Download PDF file
  • EP ID EP145799
  • DOI -
  • Views 111
  • Downloads 0

How To Cite

Sathish S, Dr. N. Duraipandian (2012). An Effective Identification of Species from DNA Sequence: A Classification Technique by Integrating DM and ANN. International Journal of Advanced Computer Science & Applications, 3(8), 104-114. https://europub.co.uk/articles/-A-145799