Classification and Diagnostic Prediction of Breast Cancers via Different Classifiers
Journal Title: International Scientific and Vocational Studies Journal - Year 2018, Vol 2, Issue 2
Abstract
Cancer is one of the leading causes of human death in the world and has caused the death of approximately 9.6 million people in 2018. Breast cancer is the most important cause of cancer deaths in women. However, breast cancer is a type of cancer that can be treated when diagnosed early. The aim of this study is to identify cancer early in life. In this study, early diagnosis and treatment were performed by using machine learning methods. The characteristics of the people included in the Wisconsin Diagnostic Breast Cancer (WDBC) data set were classified by support vector machines (SVM), k-nearest neighborhood, Naive Bayes, J48 and random forests methods. The preprocessing step was applied to the data set prior to classification. After the preprocessing stage, 5 different classifiers were applied to the data using 10-fold cross-validation method. Accuracy, sensitivity, specificity values and confusion matrices were used to measure the success of the methods. As a result of the application, it was found that SVM with linear kernel was the most successful method with 98.24% success rate. Although it was a very simple method, the second most successful method was the k-nearest neighborhood method with a success rate of 97.72%. When the results obtained from feature selection are evaluated, it is seen that feature selection and other preprocessing methods increase the success of the system. It can be said that the success achieved in comparison with previous studies is at a good level.
Authors and Affiliations
Ahmet Saygılı
Forecasting of Electricity Generation Shares by Fossil Fuels Using Artificial Neural Network and Regression Analysis in Turkey
This study is conducted to get predictions for the generation of electricity by annual production shares and decide the most suitable method for future periods. Between 2010-2017 in Turkey, the relation of generation sha...
Performance Analysis of Storage, Grid Connected Hybrid Photovoltaic System
Photovoltaic solar energy plants are rapidly increasing. These systems are generally on-grid or off-grid photovoltaic systems. In this study, a hybrid system is realized and analyzed. This system contains feature of on-g...
Interlock Optimization Of An Accelerator Using Genetic Algorithm
Accelerators are systems where high-tech experiments are conducted today and contain high-tech constructions. Construction and operation of accelerators require multidisciplinary studies. Each accelerator structure has i...
The Status Of Automation System At The International Islamic University Chittagong (IIUC) Library, Bangladesh: A Study
This study evaluated the performance of the central library at International Islamic University Chittagong in Bangladesh and tried to measure the operational process of “Koha open source integrated library system (ILS)”...
Anselm of Canterbury and Scholastic Thought
Anselm lived between 1033-1109. He's an Italian. Throughout his life, he has alsobeen a monk, a pastor. But in a short time he became a renowned philosopher by finding intelligent solutions to the problems of God's being...