An Analysis of Gene Expression Variations in Lymphoma, Using a Fuzzy Classification Model
Journal Title: Journal of Health Management and Informatics - Year 2017, Vol 4, Issue 1
Abstract
Introduction: Cancer is a major cause of mortality in the modern world, and one of the most important health problems in societies. During recent years, research on cancer as a system biology disease is focused on molecular differences between cancer cells and healthy cells. Most of the proposed methods for classifying cancer using gene expression data act as black boxes and lack biological interpretability. The goal of this study is to design an interpretable fuzzy model for classifying gene expression data of Lymphoma cancer. Method: In this research, the investigated microarray contained 45 samples of lymphoma. Total number of genes was 4026 samples. At first, we offer a hybrid approach to reduce the data dimension for detecting genes involved in lymphoma cancer. In lymphoma microarray, six out of 4029 genes were selected. Then, a fuzzy interpretable classifier was presented for classification of data. Fuzzy inference was performed using two rules which had the highest scores. Weka3.6.9 software was used to reduce the features and the fuzzy classifier model was implemented in MATLAB R2010a. Results of this study were assessed by two measures of accuracy and precision. Results: In pre-processing stage, in order to classify gene expression data of Lymphoma, six out of 4026 genes were identified as cancer-causing genes, and then the fuzzy classifier model was applied on the obtained data. The accuracy of the results of classification was 96 percent using 10 rules with the highest scores and that using 2 rules with the highest scores was about 98 percent. Conclusion: In the proposed approach, for the first time, a fully fuzzy method named a minimal rule fuzzy classification (MRFC) was introduced for extracting fuzzy rules with biological interpretability and meaning extraction from gene expression data. Among the most outstanding features of this method is the ability of extracting a small set of rules to interpret effective gene expression in cancer patients. Another result of this approach is successfully addressing the problem of disproportion between the number of samples and genes in microarrays with the proposed Filter-Wrapper Feature Selection method (FWFS).
Authors and Affiliations
Zahra Roozbahani, Jalal Rezaeenour, Mansoureh Yari Eili, Ali Katanforoush
An Analysis of Gene Expression Variations in Lymphoma, Using a Fuzzy Classification Model
Introduction: Cancer is a major cause of mortality in the modern world, and one of the most important health problems in societies. During recent years, research on cancer as a system biology disease is focused on molecu...
An Overview of the Current State and Prospects of Development of e-Health in Uzbekistan
Introduction: A significant role is played by the automation of diagnostic and treatment process, and the implementation of information and communication technologies, medical information systems, telemedicine, electroni...
Spatial Assessment of Accessibility to Public Healthcare Services: A Case Study on Accessibility to Hospitals in Shiraz
Introduction: Unfair distribution of healthcare services is one of the most important issues all over the world. The present study aimed to determine the distribution pattern of available hospital beds and the accessibil...
Founder’s Syndrome and Firm Performance of Small and Medium Scale Enterprises in Nigeria
Introduction: Founder’s syndromes have become a significant issue in SMEs performance. This study examined the impact of founder’s syndrome on firm performance of small and medium scale enterprises in Nigeria. Method: Th...
The Eight Principles of Medical Librarians in the Health Systems
The access to health information for all people is the fundamental goal of the World Health Organization (WHO). It comes true when both health professionals and patients as beneficiaries have access to health information...