Identifying Risk Factors for Heart Failure: A Case Study Employing Data Mining Algorithms

Journal Title: Journal of Data Science and Intelligent Systems - Year 2024, Vol 2, Issue 3

Abstract

Heart diseases are increasingly present in the lives of human beings and are diseases that affect the heart and blood vessels and can lead the person who develops to death. In this article, we analyzed an open and public database on heart failure composed of a sample of 299 people and 12 attributes. This article presents a preprocessing technique using area under the curve (AUC) filters, which increases the efficiency of the algorithms by decreasing the parameters, leading to better memory usage and computational processing. To enhance our results, we used a methodology involving 102 simultaneous validations. This approach allowed us to obtain more robust and reliable results. In addition, we used the receiver operating characteristic curve to evaluate the overall performance of each attribute. We trained a set of nine classification algorithms, among which the random forest learner stood out with an accuracy of 87.21% when using a filter that considered attributes with AUC greater than 0.4, considering values of AUC. Additionally, the fuzzy rules learner demonstrated its effectiveness by achieving an accuracy of 84.45% with a filter limit of 0.6, focusing on ejection fraction, serum sodium, time attributes, and class for death events. This analysis demonstrated the ability of these algorithms to effectively use a reduced number of attributes for accurate predictions.

Authors and Affiliations

Vitória S. Souza, Danielli A. Lima

Keywords

Related Articles

Applications of Quantum Computing in Health Sector

The purpose of this paper is to provide an overview of the current state of quantum computing in the health sector and to explore its potential future applications. Quantum computing has the potential to revolutionize a...

3D-STCNN: Spatiotemporal Convolutional Neural Network Based on EEG 3D Features for Detecting Driving Fatigue

Fatigue driving has become one of the main causes of traffic accidents, and driving fatigue detection based on electroencephalogram (EEG) can effectively evaluate the driver's mental state and avoid the occurrence of tra...

Symmetric Kernel-Based Approach for Elliptic Partial Differential Equation

In this work, two globally supported and positive definite radial kernels: generalized inverse multiquadric and linear Laguerre Gaussian radial kernels were used to construct symmetric kernel-based interpolating scheme u...

A Study of the Effects of the Shape Parameter and Type of Data Points Locations on the Accuracy of the Hermite-Based Symmetric Approach Using Positive Definite Radial Kernels

Theoretical approximation ideas served as the driving force behind the research. one can see that the shape parameter's behavior is driven by the kind of problem and the analytical standards that are applied. the primary...

Efficient Scheduling of Data Transfers in Multi-tiered Storage

Multi-tiered persistent storage systems integrate many types of persistent storage devices, such as different types of NVMes, SSDs, and HDDs. This integration provides a multi-level view of persistent storage, where each...

Download PDF file
  • EP ID EP752188
  • DOI 10.47852/bonviewJDSIS32021386
  • Views 22
  • Downloads 0

How To Cite

Vitória S. Souza, Danielli A. Lima (2024). Identifying Risk Factors for Heart Failure: A Case Study Employing Data Mining Algorithms. Journal of Data Science and Intelligent Systems, 2(3), -. https://europub.co.uk/articles/-A-752188