Applying Back Propagation Algorithm for classification of fragile genome sequence

Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5

Abstract

Abstract : Most frequently occurring recurrent chromosomal translocation allied with all subtype of leukemia are available in Mitel Mann Data base. We have retrieved about 55 such genome sequence from TIC dB database with 100% similarity score and got noncoding sequence of chromosome 9 and 22 as positive example of fragile site. Another 55 housekeeping genome sequence is taken for classification purpose. For content based analysis we have extracted 20 features of frequency density of mono nucleotide and dinucleotide. The network is designed by determining hyper parameters like number of hidden layer, hidden neurons and input features. Firstwe took 20 input features and there after 16 for reducing number of free parameters (i.e. weight space). Network is also pruned for succeeding experiments. The training strategy was also exhaustively explored, basedon literature study and trial and error heuristic methods to achieve more and more accuracy. Regularization is also employed by cross validation and early stopping. We have achieved 95% accuracy for training data and 70% to test data in first experiment. To avoid this over fitting at last we could achieve 93% over all accuracy and outlier detection, too. We could be able to show that dinucleotide frequency density is important statistical feature for classifying genome sequence. This classifier can show the probability of fragility to occur in genome sequence at very early stage so as to deal with the diesis at prognosis phase.

Authors and Affiliations

Medha Patel , Dr. Devarshi Mehta , Dr. Patrick Patterson , Dr. Rakesh Rawal

Keywords

Related Articles

Sentiment of Sentence in Tweets: A Review

Abstract: Determine the sentiment of sentence that is positive or negative based on the presence of part of speech tag, the emoticons present in the sentences. For this research we use the most popular microblogging sitt...

Enhanced Data Processing Using Positive Negative Association Mining on AJAX Data

Knowledge discovery is the process of analyzing data from different perspectives and summarizing it into useful information. [1] Association rule mining is a data mining process used widely in traditional databases to fi...

 Correlation Coefficient Based Average Textual Similarity Modelfor Information Retrieval System in Wide Area Networks

 Abstract: In wide area networks, retrieving the relevant text is a challenging task for information retrievalbecause most of the information requests are text based. The focus of paper is on the similarity measurem...

 Distinct Revocable Data Hiding In Ciphered Image

 Abstract: This scheme proposes a secure and authenticated reversible data hiding in cipher images.Nowadays, we pay more attention to reversible data hiding in encrypted images, as the original cover can bereversibl...

Prototyping and Simulation of Robot Group Intelligence using Kohonen Networks

Abstract:Intelligent agents such as robots can form ad hoc networks and replace human being in many dangerous scenarios such as a complicated disaster relief site. This project prototypes and builds a computer simulator...

Download PDF file
  • EP ID EP133682
  • DOI -
  • Views 98
  • Downloads 0

How To Cite

Medha Patel, Dr. Devarshi Mehta, Dr. Patrick Patterson, Dr. Rakesh Rawal (2016). Applying Back Propagation Algorithm for classification of fragile genome sequence. IOSR Journals (IOSR Journal of Computer Engineering), 18(5), 1-10. https://europub.co.uk/articles/-A-133682