Optimization Techniques for SCAD Variable Selection in Medical Research

Abstract

High-dimensional data analysis requires variable selection to identify truly relevant variables. More often it is done implicitly via regularization, such as penalized regression. Of the many versions of penalties, SCAD has shown good properties and has been widely adopted in medical research and many more areas. This paper reviews the various optimization techniques in solving SCAD penalized regression. High-dimensional data analysis has been a common and important topic in biomedical/genomic/clinical studies. For example, the identification of genetic factors for complex diseases such as lung cancer implicates a variety of genetic variants. For high-dimensional data, there is the well-known problem of curse of dimensionality arising in modeling. Therefore, variable selection is a fundamental task for high-dimensional statistical modeling. The "old school" way of doing variable selection is to follow a subset selection procedure prior to building the model of interest. The procedure commonly adopts AIC/BIC as evaluation metric and often iterates in a stepwise fashion. Yet this is independent of the subsequent modeling task hence the effectiveness might be less desirable. A more natural way is to integrate the variable selection into the modeling itself, i.e., the penalized regression, which simultaneously performs variable selection and coefficient estimation. Theoretically, the "best" penalty for the penalized regression is the number of non-zero variables, to push as many variables to zero as possible. Yet, it is well known that the L0 (also known as the entropy penalty) optimization [1] is infeasible. As such, the L1 (LASSO) penalty Tibshirani [2] is our "next best" candidate, which is widely adopted in statistical and machine learning community for sparse solutions. However, [3] point out that L1 suffers the problem of biasedness. They propose the Smoothly Clipped Absolute Deviation (SCAD) penalty that can produce unbiased estimates while retaining good properties of L1. Subsequently, the SCAD penalty function has seen a wide range of applications including medical/clinical research, such as [1,4-7]. Nevertheless, the estimating procedure for SCAD penalized regression is no trivial task, because the target function a) is a high-dimensional non-concave function, b) is singular at the origin, c) does not have continuous second order derivatives.

Authors and Affiliations

Yan Fang, Yan Yan Kong, Yumei Jiao

Keywords

Related Articles

Differentiating Crohn’s Disease from Ulcerative Colitis - New Factors

The characteristics of inflammatory bowel diseases (IBD) are often ambiguous. The information obtained may deepen the cur-rent state of knowledge about ulcerative colitis and Crohn’s disease. For this reason, finding the...

Fine structure of Somatotrophs in Pars Distalis of the Indian wild Caught Female Bat, Taphozous nudiventris kachhensis (Dobson)

The ultra structural observations on Somatotrophs During estrus cells are spherical to oval in shape with the spherical nucleus. Cytoplasm of cell is filled with a large number of round to oval shaped secretory granules...

Multisession CyberKnife Radiosurgery for Symptomatic Abducens Nerve Palsy

Abducens nerve palsy causes diplopia which lowers patients’ quality of life. Nerve palsy due to tumor compression is treated by surgery, but Abducens nerve is difficult to access. Ten patients with benign and malignant t...

The Role of the Sciton Profile Nd-Yag Laser in the Reduction of Cutaneous Varicosities, a Pilot Study

The current pilot study reports a 19 patient case series to determine the effectiveness of the Sciton Profile Nd- YAG laser in the reduction of cutaneous vessels. Documentation was made of vessel characteristics includin...

Actuator for Nano biomedical Research

In this work, we obtain the parameters of the actuator for nano biomedical research. We have mathematical model of the actuator with the piezoelectric or magneto strictive effect.Actuator for nano biomedical research is...

Download PDF file
  • EP ID EP585942
  • DOI 10.26717/BJSTR.2018.08.001632
  • Views 126
  • Downloads 0

How To Cite

Yan Fang, Yan Yan Kong, Yumei Jiao (2018). Optimization Techniques for SCAD Variable Selection in Medical Research. Biomedical Journal of Scientific & Technical Research (BJSTR), 8(2), 6425-6426. https://europub.co.uk/articles/-A-585942