A Diabetic Disease Prediction Model Based on Classification Algorithms
Journal Title: Annals of Emerging Technologies in Computing - Year 2019, Vol 3, Issue 3
Abstract
Diabetes is one of the chronic diseases in the world, 246 million people are inflicted by this disease and according to a World Health Organisation (WHO) report, this figure will increase to 380 million sufferers by 2025. Many other debilitating and critical health issues may further develop if this disease is not diagnosed or remain unidentified. Machine Learning (ML) techniques are now being used in various fields like education, healthcare, business, recommendation system, etc. Healthcare data is complex and high in dimensionality and contains irrelevant information - due to this, the prediction accuracy is low. The Pima Indians Diabetes Dataset was used in this research, it consisted of 768 records. Firstly, the missing values are replaced by the median followed by Linear Discriminant Analysis. Using the Python programming language, feature selection techniques is applied in combination with five classification algorithms: Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Logistic Regression, Random Forest and Decision Tree. The aim of this paper is to compare the different classification algorithms in order to predict diabetes in patients more accurately. K-fold cross-validation is applied, considering k to be 2, 4, 5 and 10. The performance parameters taken are the: accuracy, precision, recall, F Score and area under the curve. Our study found that the MLP classifier gave the highest accuracy of 78.7% with a recall of 61.26%, precision of 72.45% and F1 Score of 65.97% for k = 4.
Authors and Affiliations
Ravinder Ahuja, Subhash C. Sharma, Maaruf Ali
A Comparative Study of Data Mining Algorithms for High Detection Rate in Intrusion Detection System
Due to the fast growth and tradition of the internet over the last decades, the network security problems are increasing vigorously. Humans can not handle the speed of processes and the huge amount of data required to ha...
IoT Energy Efficiency through Centrality Metrics
The Internet of Things is the current and next revolution in integrating various technologies and wireless communications. It has been shown to make an important contribution in various modes of communication, in homes,...
A Method of Body Parts Force Displacements Calculation of Metal-Cutting Machine Tools Using CAD and CAE Technologies
This paper describes a developed new method of body parts force displacements calculation of metal-cutting machine tools using combination of CAD and CAE technologies. It was carried out the analysis of analytical method...
Rethinking Digital Forensics
In the modern socially-driven, knowledge-based virtual computing environment in which organisations are operating, the current digital forensics tools and practices can no longer meet the need for scientific rigour. Ther...
The Cascade Carry Array Multiplier – A Novel Structure of Digital Unsigned Multipliers for Low-Power Consumption and Ultra-Fast Applications
This article presents a low power consumption, high speed multiplier, based on a lowest transistor count novel structure when compared with other traditional multipliers. The proposed structure utilizes 4×4-bit adder uni...