Performa Comparison of the K-Means Method for Classification in Diabetes Patients Using Two Normalization Methods

Abstract

The diabetes classification system is very useful in the health sector. This paper discusses the classification system for diabetes using the K-Means algorithm. The Pima Indian Diabetes (PID) dataset is used to train and evaluate this algorithm. The unbalanced value range in the attributes affects the quality of the classification result, so it is necessary to preprocess the data which is expected to improve the accuracy of the PID dataset classification result. Two types of preprocessing methods are used that are min-max normalization and z-score normalization. These two normalization methods are used and the classification accuracies are compared. Before the data classification process is carried out, the data is divided into training data and test data. The result of the classification test using the K-Means algorithm has shown that the best accuracy lies in the PID dataset which has been normalized using the min-max normalization method, which 79% compared to z-score normalization.

Authors and Affiliations

Dwianti Westari, Dr. Abdul Halim,

Keywords

Related Articles

Optimization Design of Reducting Co & HC Gas through Alloy Converter Catalyst Prototype Model

Technological developments have an impact on increasing the number of motorized vehicles such as motorcycles, cars, and other modes of transportation. This causes air pollution impacts such as gas emissions from fossil f...

Measuring Company Value with Intervening Profitability Variables in Companies Listed on the Indonesia Stock Exchange's LQ-45 Index for the 2018–2021 Period

This study aims to analyze the effect of company size, capital structure and dividend policy on company value with profitability as an intervening variable in LQ-45 Companies Listed on the Indonesia Stock Exchange for th...

Strengthening Prevention of Negativeness from Social Networks to the Political Identity of Vietnamese Students

The majority of Vietnamese students today belong to the Gen Z generation (born after 1995), they are the generation of global citizens of the digital age, capable of changing the world and determining politics, economics...

The Impact of Belief, Attitude and Subjective Norm on OCOP Products Purchase Intention of Vietnamese Consumers

We do research about the impact of belief, attitude and subjective norm on OCOP (one community one product) products purchase intention of Vietnamese consumers. OCOP (abbreviated in English as One commune one product). I...

Knowledge and Problems Encountered During Teenage Pregnancy in Afgoi District, Somalia: A Descriptive Cross- Sectional Study

Pregnancy in a female under the age of 19 is referred to as teenage pregnancy or adolescent pregnancy. Due to the detrimental effects on the prospective bride and her offspring, teenage pregnancy has been referred to as...

Download PDF file
  • EP ID EP691984
  • DOI 10.47191/ijmra/v4-i1-03
  • Views 225
  • Downloads 0

How To Cite

Dwianti Westari, Dr. Abdul Halim, (2021). Performa Comparison of the K-Means Method for Classification in Diabetes Patients Using Two Normalization Methods. International Journal of Multidisciplinary Research and Analysis, 4(03), -. https://europub.co.uk/articles/-A-691984