Feature Selection and Extraction Framework for DNA Methylation in Cancer

Abstract

Feature selection methods for cancer classification are aimed to overcome the high dimensionality of the biomedical data which is a challenging task. Most of the feature selection methods based on DNA methylation are time consuming during testing phase to identify the best pertinent features subset that are relevant to accurate prediction. However, the hybridization between feature selection and extraction methods will bring a method that is far fast than only feature selection method. This paper proposes a framework based on both novel feature selection methods that employ statistical variation, standard deviation and entropy, along with extraction methods to predict cancer using three new features, namely, Hypomethylation, Midmethylation and Hypermethylation. These new features represent the average methylation density of the corresponding three regions. The three features are extracted from the selected features based on the analysis of the methylation behavior. The effectiveness of the proposed framework is evaluated by the breast cancer classification accuracy. The results give 98.85% accuracy using only three features out of 485,577 features. This result proves the capability of the proposed approach for breast cancer diagnosis and confirms that feature selection and extraction methods are critical for practical implementation.

Authors and Affiliations

Abeer A. Raweh, Mohammad Nassef, Amr Badr

Keywords

Related Articles

Distributed Optimization Model of Wavelet Neuron for Human Iris Verification

Automatic human iris verification is an active research area with numerous applications in security purposes. Unfortunately, most of feature extraction methods in human iris verification systems are sensitive to noise, s...

A Proposed Model of Cloud based e-Learning for Najran University

For the time being, the educational institutions are keen to use e-learning in their educational environment. This, in turn, will support their learning process and allow the learners to access any service or learning ma...

Load Balancing based on Bee Colony Algorithm with Partitioning of Public Clouds

Cloud computing is an emerging trend in the IT industry that provides new opportunities to control costs associated with the creation and maintenance of applications. Of prevalent issues in cloud computing, load balancin...

Interface of an Automatic Recognition System for Dysarthric Speech

This paper addresses the realization of a Human/Machine (H/M) interface including a system for automatic recognition of the Continuous Pathological Speech (ARSCPS) and several communication tools in order to help frail p...

A Generic Adaptive Multi-Gene-Set Genetic Algorithm (AMGA)

Genetic algorithms have been used extensively in solving complex solution-space search problems. However, certain problems can include multiple sub-problems in which multiple searches through distinct solution-spaces are...

Download PDF file
  • EP ID EP259990
  • DOI 10.14569/IJACSA.2017.080705
  • Views 99
  • Downloads 0

How To Cite

Abeer A. Raweh, Mohammad Nassef, Amr Badr (2017). Feature Selection and Extraction Framework for DNA Methylation in Cancer. International Journal of Advanced Computer Science & Applications, 8(7), 30-36. https://europub.co.uk/articles/-A-259990