Classifying Lung Adenocarcinoma and Squamous Cell Carcinoma using RNA-Seq Data

Journal Title: Cancer Studies & Molecular Medicine – Open Journal - Year 2017, Vol 3, Issue 2

Abstract

Background: Lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC) are two primary subtypes of non-small cell lung carcinoma (NSCLC). Currently, the most widely used method to discriminate between LUAD and LUSC is hematoxylin-eosin (HE) staining. However, this method sometimes is unable to make the precise diagnosis on LUAD or LUSC. More accurate diagnostic approaches are highly desired. Methods: We propose to use gene expression profile to discriminate NSCLC patient’s subtype. We leveraged RNA-Seq data from The Cancer Genome Atlas (TCGA) and randomly split the data into training and testing subsets. To construct classifiers based on the training data, we considered three methods: logistic regression on principal components (PCR), logistic regression with LASSO shrinkage (LASSO), and kth nearest neighbors (KNN). Performances of classifiers were evaluated and compared based on the testing data. Results: All gene expression-based classifiers show high accuracy in discriminating LUSC and LUAD. The classifier obtained by LASSO has the smallest overall misclassification rate of 3.42% (95% CI: 3.25%-3.60%) when using 0.5 as the cutoff value for the predicted probability of belonging to a subtype, followed by classifiers obtained by PCR (4.36%, 95% CI: 4.23%- 4.49%) and KNN (8.70%, 95% CI: 8.57%-8.83%). The LASSO classifier also has the highest average area under the receiver operating characteristic curve (AUC) value of 0.993, compared to PCR (0.987) and KNN (0.965). Conclusions: Our results suggest that mRNA expressions are highly informative for classifying NSCLC subtypes and may potentially be used to assist clinical diagnosis.

Authors and Affiliations

Chi Wang

Keywords

Related Articles

Role of Molecular Imaging in Oncology

Molecular Imaging (MI) is an emerging technology for the early detection of disease, staging of the disease, and for monitoring response to therapy. It also offers a non-invasive method to detect in vivo biological funct...

Is it Time to Start Using Mitochondrial DNA Copy Number as an Indicator of Health and Diseases?

Clinical biochemistry and pathology have contributed too many assays for diagnosis and prognosis of human health and diseases. Bedside biochemistry has revolutionized modern medicine and the invention of new generation b...

Immuno-oncology: Is it a new hope for cancer patients?

Cancer is the one of the leading causes of death, whose incidences is increasing day by day due to lack of understanding about its complete mechanism. Therefore, to understand complete mechanism of cancer, researchers st...

Current Status of Anti Epidermal Growth Factor Receptor Therapy in the Curative Treatment of Head and Neck Squamous Cell Carcinoma

Squamous cell carcinoma of head and neck is the most common malignancy of the upper aero digestive tract in the world. In this article, we attempt to summarize the role of antiepidermal growth factor therapy (EGFR) in th...

A Case of Choroidal Metastasis from Small-Cell Lung Carcinoma

A 74-year-old man visited to our hospital due to severe visual disturbance in his left eye for recent one month. Fundoscopy demonstrated the serous retinal detachment at the nose side of the left eye (Figure 1A, arrows).

Download PDF file
  • EP ID EP551937
  • DOI 10.17140/CSMMOJ-3-120
  • Views 113
  • Downloads 0

How To Cite

Chi Wang (2017). Classifying Lung Adenocarcinoma and Squamous Cell Carcinoma using RNA-Seq Data. Cancer Studies & Molecular Medicine – Open Journal, 3(2), 27-31. https://europub.co.uk/articles/-A-551937