Applying Back Propagation Algorithm for classification of fragile genome sequence
Journal Title: IOSR Journals (IOSR Journal of Computer Engineering) - Year 2016, Vol 18, Issue 5
Abstract
Abstract : Most frequently occurring recurrent chromosomal translocation allied with all subtype of leukemia are available in Mitel Mann Data base. We have retrieved about 55 such genome sequence from TIC dB database with 100% similarity score and got noncoding sequence of chromosome 9 and 22 as positive example of fragile site. Another 55 housekeeping genome sequence is taken for classification purpose. For content based analysis we have extracted 20 features of frequency density of mono nucleotide and dinucleotide. The network is designed by determining hyper parameters like number of hidden layer, hidden neurons and input features. Firstwe took 20 input features and there after 16 for reducing number of free parameters (i.e. weight space). Network is also pruned for succeeding experiments. The training strategy was also exhaustively explored, basedon literature study and trial and error heuristic methods to achieve more and more accuracy. Regularization is also employed by cross validation and early stopping. We have achieved 95% accuracy for training data and 70% to test data in first experiment. To avoid this over fitting at last we could achieve 93% over all accuracy and outlier detection, too. We could be able to show that dinucleotide frequency density is important statistical feature for classifying genome sequence. This classifier can show the probability of fragility to occur in genome sequence at very early stage so as to deal with the diesis at prognosis phase.
Authors and Affiliations
Medha Patel , Dr. Devarshi Mehta , Dr. Patrick Patterson , Dr. Rakesh Rawal
More General Sophisticated Method of Implementation of Fiber to the Homes
Fiber to the Homes (FTTH) is one of the most important fiber optic applications, since FTTH provides huge bandwidth. The single fiber offering multi services such as :( Data, Voice, Video etc.).Comparing FTTH and c...
Risk Factor for Periodontitis
Periodontal disease possesses a significant challenge to the patient and the oral health care professional equally .Risk factor that are associated with periodontal disease must be properly identified and examin...
Renal Calculi Detection in Ultrasound images and Diagnosis of Images using Image Segmentation
Abstract: Now-a-days Renal Calculi is becoming a most common disease in both men and women. Calculi are due to abnormal collection of certain chemicals like oxalate, phosphate and uric acid. These calculi can be pr...
Object Removal Using Super-Resolution-Based In-Painting
Abstract: In-painting is the process of reconstructing lost or deteriorated part of images based on the background information. Image in-painting fills the missing or damaged region in an image, utilizing information of...
Enhancement of Network Administration through Software Defined Networks
Abstract: Now a days organizing the Network is very complex and challenging issue. To control, manage, and to provide a secure communication network, network managers must grapple with low-level vendor-specific configura...