A Diabetic Disease Prediction Model Based on Classification Algorithms

Journal Title: Annals of Emerging Technologies in Computing - Year 2019, Vol 3, Issue 3

Abstract

Diabetes is one of the chronic diseases in the world, 246 million people are inflicted by this disease and according to a World Health Organisation (WHO) report, this figure will increase to 380 million sufferers by 2025. Many other debilitating and critical health issues may further develop if this disease is not diagnosed or remain unidentified. Machine Learning (ML) techniques are now being used in various fields like education, healthcare, business, recommendation system, etc. Healthcare data is complex and high in dimensionality and contains irrelevant information - due to this, the prediction accuracy is low. The Pima Indians Diabetes Dataset was used in this research, it consisted of 768 records. Firstly, the missing values are replaced by the median followed by Linear Discriminant Analysis. Using the Python programming language, feature selection techniques is applied in combination with five classification algorithms: Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Logistic Regression, Random Forest and Decision Tree. The aim of this paper is to compare the different classification algorithms in order to predict diabetes in patients more accurately. K-fold cross-validation is applied, considering k to be 2, 4, 5 and 10. The performance parameters taken are the: accuracy, precision, recall, F Score and area under the curve. Our study found that the MLP classifier gave the highest accuracy of 78.7% with a recall of 61.26%, precision of 72.45% and F1 Score of 65.97% for k = 4.

Authors and Affiliations

Ravinder Ahuja, Subhash C. Sharma, Maaruf Ali

Keywords

Related Articles

Object Identification Based on the Automated Extraction of Spatial Semantics from Web3D Scenes

We present a web-based methodology for the extraction of semantic information and object identification in poorly annotated Web3D scenes. Our approach is based on a set of rules that mimic human spatial cognition, backed...

The Cascade Carry Array Multiplier – A Novel Structure of Digital Unsigned Multipliers for Low-Power Consumption and Ultra-Fast Applications

This article presents a low power consumption, high speed multiplier, based on a lowest transistor count novel structure when compared with other traditional multipliers. The proposed structure utilizes 4×4-bit adder uni...

A Novel Approach for Network Attack Classification Based on Sequential Questions

With the development of incipient technologies, user devices becoming more exposed and ill-used by foes. In upcoming decades, traditional security measures will not be sufficient enough to handle this huge threat towards...

Hardware Dynamic Memory Manager for Hard Real-Time Systems

This paper presents novel hardware architecture of dynamic memory manager providing memory allocation and deallocation operations that are suitable for hard real-time and safety-critical systems due to very high determin...

An Investigation on Exhaustion of SAP ERP Users: Influence of Pace of Change and Technostress

Despite recent growing research interest on ERP research, the understanding on ERP induced exhaustion is still limited. This study examines how the pace of change of ERP functionalities and interface causes exhaustion in...

Download PDF file
  • EP ID EP594478
  • DOI 10.33166/AETiC.2019.03.005
  • Views 154
  • Downloads 0

How To Cite

Ravinder Ahuja, Subhash C. Sharma, Maaruf Ali (2019). A Diabetic Disease Prediction Model Based on Classification Algorithms. Annals of Emerging Technologies in Computing, 3(3), 44-52. https://europub.co.uk/articles/-A-594478