Feature Selection and Extraction Framework for DNA Methylation in Cancer

Abstract

Feature selection methods for cancer classification are aimed to overcome the high dimensionality of the biomedical data which is a challenging task. Most of the feature selection methods based on DNA methylation are time consuming during testing phase to identify the best pertinent features subset that are relevant to accurate prediction. However, the hybridization between feature selection and extraction methods will bring a method that is far fast than only feature selection method. This paper proposes a framework based on both novel feature selection methods that employ statistical variation, standard deviation and entropy, along with extraction methods to predict cancer using three new features, namely, Hypomethylation, Midmethylation and Hypermethylation. These new features represent the average methylation density of the corresponding three regions. The three features are extracted from the selected features based on the analysis of the methylation behavior. The effectiveness of the proposed framework is evaluated by the breast cancer classification accuracy. The results give 98.85% accuracy using only three features out of 485,577 features. This result proves the capability of the proposed approach for breast cancer diagnosis and confirms that feature selection and extraction methods are critical for practical implementation.

Authors and Affiliations

Abeer A. Raweh, Mohammad Nassef, Amr Badr

Keywords

Related Articles

The Performance of the Bond Graph Approach for Diagnosing Electrical Systems

The increasing complexity of automated industrial systems, the constraints of competitiveness in terms of cost of production and facility security have mobilized in the last years a large community of researchers to impr...

Intrusion Detection System with Correlation Engine and Vulnerability Assessment

The proposed Intrusion Detection System (IDS) which is implemented with modern technologies to address certain prevailing problems in existing intrusion detection systems’ is capable of giving an advanced output to the s...

 SVD Based Image Processing Applications: State of The Art, Contributions and Research Challenges

 Singular Value Decomposition (SVD) has recently emerged as a new paradigm for processing different types of images. SVD is an attractive algebraic transform for image processing applications. The paper proposes an...

 wavelet de-noising technique applied to the PLL of a GPS receiver embedded in an observation satellite

 In this paper, we study the Doppler effect on a GPS(Global Positioning System) on board of an observation satellite that receives information on a carrier wave L1 frequency 1575.42 MHz .We simulated GPS signal acqu...

Experimental Evaluation of the Virtual Environment Efficiency for Distributed Software Development

At every software design stage nowadays, there is an acute need to solve the problem of effective choice of libraries, development technologies, data exchange formats, virtual environment systems, characteristics of virt...

Download PDF file
  • EP ID EP259990
  • DOI 10.14569/IJACSA.2017.080705
  • Views 110
  • Downloads 0

How To Cite

Abeer A. Raweh, Mohammad Nassef, Amr Badr (2017). Feature Selection and Extraction Framework for DNA Methylation in Cancer. International Journal of Advanced Computer Science & Applications, 8(7), 30-36. https://europub.co.uk/articles/-A-259990