Applying Back Propagation Algorithm for classification of fragile genome sequence

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Most frequently occurring recurrent chromosomal translocation allied with all subtype of leukemia are available in Mitel Mann Data base. We have retrieved about 55 such genome sequence from TIC dB database with 100% similarity score and got noncoding sequence of chromosome 9 and 22 as positive example of fragile site. Another 55 housekeeping genome sequence is taken for classification purpose. For content based analysis we have extracted 20 features of frequency density of mono nucleotide and dinucleotide. The network is designed by determining hyper parameters like number of hidden layer, hidden neurons and input features. Firstwe took 20 input features and there after 16 for reducing number of free parameters (i.e. weight space). Network is also pruned for succeeding experiments. The training strategy was also exhaustively explored, basedon literature study and trial and error heuristic methods to achieve more and more accuracy. Regularization is also employed by cross validation and early stopping. We have achieved 95% accuracy for training data and 70% to test data in first experiment. To avoid this over fitting at last we could achieve 93% over all accuracy and outlier detection, too. We could be able to show that dinucleotide frequency density is important statistical feature for classifying genome sequence. This classifier can show the probability of fragility to occur in genome sequence at very early stage so as to deal with the diesis at prognosis phase.

Authors and Affiliations

Medha Patel , Dr. Devarshi Mehta , Dr. Patrick Patterson , Dr. Rakesh Rawal

Keywords

Related Articles

  Corporate Policy Governance in Secure MD5 DataChanges and Multi Hand Administration

Abstract: Policy based management is an administrative approach that simplify the management of a givenendeavor by establishing policies to deal with situation that are likely to occur. Most of the social network andmobi...

 Traffic Dynamics in Virtual Routing Multi Topology System

 Providing a better performance is the key in IP network systems.An Adaptive Multipath Routing(AMR) system is introduced to handle the unpredicted traffic dynamics. The proposed system consists of Weight Computa...

 A review on Visualization Approaches of Data mining in heavyspatial databases

 Abstract: Data mining is the phenomenon to extract and recognized the new required pattern or types from thelarge data seta or data bases and whatever required data is being extracted and separated from large datab...

To Propose Improvement in Probability based object tracking technique for Multiple Object Tracking

Abstract: The object tracking is the technique which is used to track object from the image or from the video. The video consists of multiple frames and in each frame location of that object had been predicted. To predic...

 Emotion Recognition using combination of MFCC and LPCCwith Supply Vector Machine

Abstract: Speech is a medium through which emotions are expressed by human being. In this paper, a mixtureof MFCC and LPCC has been proposed for audio feature extraction. One of the greatest advantage of MFCC isthat it i...

Download PDF file
  • EP ID EP133682
  • DOI -
  • Views 79
  • Downloads 0

How To Cite

Medha Patel, Dr. Devarshi Mehta, Dr. Patrick Patterson, Dr. Rakesh Rawal (2016). Applying Back Propagation Algorithm for classification of fragile genome sequence. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 1-10. https://europub.co.uk/articles/-A-133682