Mortality Prediction based on Imbalanced New Born and Perinatal Period Data

Abstract

This study was carried out by the New York State Department of Health, between 2012 and 2016. This experiment relates to six supervised machine learning methods: Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosting (GB), Random Forest (RF), Deep Learning (DL) and the Ensemble Model, all of which are used in the prediction of infant mortality. This experiment applied ensemble model that concentrated on assigning different weights to different models per output class in order to obtain a better predictive performance for infant mortality. Efforts were made to measure the performance and compare the classifier accuracy of each model. Several criteria, including the area under ROC curve, were considered when comparing the ensemble model (GB, RF and DL) with the other five models (SVM, LR, DL, GB and RF). In terms of these different criteria, the ensemble model outperformed the others in predicting survival rates among infant patients given a balanced data set (the areas under the ROC curve for minor, moderate, major and extreme were 98%, 95%, 92% and 97% respectively, giving a total accuracy of 80.65%). For the imbalanced dataset, (the areas under the ROC curve for minor, moderate, major and extreme were 98%, 98%, 99% and 99% respectively, giving total accuracy increased to 97.44%). The results of the experiments used in this dissertation showed that using the ensemble model provided a better level of prediction for infant mortality than the other five models, based on the relative prediction accuracy for each model for each output class. Therefore, the ensemble model provides and extremely promises classifier in terms of predicting infant mortality.

Authors and Affiliations

Wafa M. AlShwaish, Maali Ibr. Alabdulhafith

Keywords

Related Articles

Analysis of Spatially Modelled High Temperature Polymer Electrolyte Membrane Fuel Cell under Dynamic Load Conditions

This paper presents an interesting approach to observe the effects of the load variations on the performance of high temperature polymer electrolyte membrane fuel cell system, such as: hydrogen and air flow rate, output...

Algorithm for Enhancing the QoS of Video Traffic over Wireless Mesh Networks

One of the major issues in a wireless mesh networks (WMNs) which needs to be solved is the lack of a viable protocol for medium access control (MAC). In fact, the main concern is to expand the application of limited wire...

Lexicon-based Bot-aware Public Emotion Mining and Sentiment Analysis of the Nigerian 2019 Presidential Election on Twitter

Online social networks have been widely engaged as rich potential platforms to predict election outcomes’ in several countries of the world. The vast amount of readily-available data on such platforms, coupled with the e...

Workplace Design and Employee’s Performance and Health in Software Industry of Pakistan

Factors like colour, light, air quality, environmental conditions, and noise has a great effect on the Health and Performance of office employees. All these factors have the impact on employee’s performance and are the r...

Comparative Analysis of Evolutionary Algorithms for Multi-Objective Travelling Salesman Problem

The Evolutionary Computation has grown much in last few years. Inspired by biological evolution, this field is used to solve NP-hard optimization problems to come up with best solution. TSP is most popular and complex pr...

Download PDF file
  • EP ID EP626542
  • DOI 10.14569/IJACSA.2019.0100808
  • Views 91
  • Downloads 0

How To Cite

Wafa M. AlShwaish, Maali Ibr. Alabdulhafith (2019). Mortality Prediction based on Imbalanced New Born and Perinatal Period Data. International Journal of Advanced Computer Science & Applications, 10(8), 51-60. https://europub.co.uk/articles/-A-626542