Mortality Prediction based on Imbalanced New Born and Perinatal Period Data

Abstract

This study was carried out by the New York State Department of Health, between 2012 and 2016. This experiment relates to six supervised machine learning methods: Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosting (GB), Random Forest (RF), Deep Learning (DL) and the Ensemble Model, all of which are used in the prediction of infant mortality. This experiment applied ensemble model that concentrated on assigning different weights to different models per output class in order to obtain a better predictive performance for infant mortality. Efforts were made to measure the performance and compare the classifier accuracy of each model. Several criteria, including the area under ROC curve, were considered when comparing the ensemble model (GB, RF and DL) with the other five models (SVM, LR, DL, GB and RF). In terms of these different criteria, the ensemble model outperformed the others in predicting survival rates among infant patients given a balanced data set (the areas under the ROC curve for minor, moderate, major and extreme were 98%, 95%, 92% and 97% respectively, giving a total accuracy of 80.65%). For the imbalanced dataset, (the areas under the ROC curve for minor, moderate, major and extreme were 98%, 98%, 99% and 99% respectively, giving total accuracy increased to 97.44%). The results of the experiments used in this dissertation showed that using the ensemble model provided a better level of prediction for infant mortality than the other five models, based on the relative prediction accuracy for each model for each output class. Therefore, the ensemble model provides and extremely promises classifier in terms of predicting infant mortality.

Authors and Affiliations

Wafa M. AlShwaish, Maali Ibr. Alabdulhafith

Keywords

Related Articles

Automatic Control of Colonoscope Movement for Modern Colonoscopy

The paper presents the mathematical realization of the trajectory that the colonoscope should have in the medical intervention, as well as the mathematical demonstration of the functions that make up the colonoscope. The...

Ultrafast Scalable Embedded DCT Image Coding for Tele-immersive Delay-Sensitive Collaboration

A delay-sensitive, real-time, tele-immersive collaboration for the future requires much lower end-to-end delay (EED) for good synchronization than that for existing teleconference systems. Hence, the maximum EED must be...

An Automated approach for Preventing ARP Spoofing Attack using Static ARP Entries

ARP spoofing is the most dangerous attack that threats LANs, this attack comes from the way the ARP protocol works, since it is a stateless protocol. The ARP spoofing attack may be used to launch either denial of service...

Using the Technology Acceptance Model in Understanding Academics’ Behavioural Intention to Use Learning Management Systems

Although e-learning is in its infancy in Saudi Arabia, most of the public universities in the country show a great interest in the adoption of learning and teaching tools. Determining the significance of a particular too...

Efficiency and Performance Analysis of a Sparse and Powerful Second Order SVM Based on LP and QP

Productivity analysis is done on the new algorithm “Second Order Support Vector Machine (SOSVM)”, which could be thought as an offshoot of the popular SVM and based on its conventional QP version as well as the LP one. O...

Download PDF file
  • EP ID EP626542
  • DOI 10.14569/IJACSA.2019.0100808
  • Views 82
  • Downloads 0

How To Cite

Wafa M. AlShwaish, Maali Ibr. Alabdulhafith (2019). Mortality Prediction based on Imbalanced New Born and Perinatal Period Data. International Journal of Advanced Computer Science & Applications, 10(8), 51-60. https://europub.co.uk/articles/-A-626542