Comparison of Severity of Illness Scores and Artificial Intelligence Models That Are Predictive of Intensive Care Unit Mortality: Meta-analysis and Review of the Literature

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Predictive Models of Intensive Care Unit Mortality - Severity of Illness Scores or Artificial Intelligence instruments? - Literature Review and Metanalysis

Cristina Barboi;
Andreas Tzavelis;
Lutfiyya NaQiyba Muhammad

ABSTRACT

Background:

The Severity of Illness Scores (SIS)- Acute Physiology and Chronic Health Evaluation (APACHE), Simplified Acute Physiology Score (SAPS), and Sequential Organ Failure Assessment (SOFA) - are current risk stratification and mortality prediction tools used in Intensive Care Units (ICU) across the globe, and rely on scores that assess disease severity on admission. Developers of Artificial Intelligence (AI) or Machine Learning (ML) models predictive of ICU mortality use the SIS performance as a reference point when reporting the performance of these computational constructs.

Objective:

Using systematic review and meta-analysis, we evaluated studies that compare ML-based mortality prediction models to SIS-based models. The review should inform clinicians regarding the prognostic value of ML-based ICU mortality prediction models compared with SIS models and their validity in supporting clinical decision-making.

Methods:

We performed a systematic search using PubMed, Scopus, Embase, and IEEE databases. Studies that report the performance of newly developed ML models predictive of ICU mortality and compare it with the performance of SIS models on the same datasets were eligible for inclusion. ML and the SIS models with a reported Area Under the Receiver Operating Characteristic (AUROC) curve were included in the meta-analysis to identify the group with superior performance. Data were extracted with guidance from the CHARMS (Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies) checklist[1] and was appraised for risk of bias and applicability using PROBAST (A Tool to Assess the Risk of Bias and Applicability of Prediction Model Studies ) [2].

Results:

After screening the literature, we identified and included 20 papers containing 47 ML models based on seven types of algorithms that were compared with three types of SIS models. The AUROC for predicting ICU mortality ranged between 0.828-0.875 for ML-based models and between 0.707-0.760 for SI-based models. We noted substantial heterogeneity among the models reported, and considerable variation among the AUROC estimates for both ML and SIS model types. Due to the high degree of heterogeneity, we performed a limited random-effect meta-analysis of externally validated subgroups of ML models and the subgroups of SIS used for comparison.

Conclusions:

ML-based models can accurately predict ICU mortality as an alternative to traditional scoring models. The high degree of heterogeneity observed within and between studies limit the assessment of pooled results. The differences in development strategies, validation, statistical, and computational methods that these models rely on impede a head-to-head comparison, and we cannot declare the superiority of one model over the other. Consequently, we make no recommendation regarding the ML-based ICU mortality prediction models’ performance in clinical practice. To bridge the knowledge gap from design to practice, ML model developers must provide explainer models and make those knowledge objects reproducible, interoperable, and transparent[3]. Clinical Trial: the review was registered and approved by the international prospective register of systematic reviews, PROSPERO (reference number CRD42021203871).

Citation

Please cite as:

Barboi C, Tzavelis A, Muhammad LN

Comparison of Severity of Illness Scores and Artificial Intelligence Models That Are Predictive of Intensive Care Unit Mortality: Meta-analysis and Review of the Literature

JMIR Med Inform 2022;10(5):e35293

DOI: 10.2196/35293

PMID: 35639445

PMCID: 9198821

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Nov 29, 2021

Date Accepted: Apr 25, 2022

Predictive Models of Intensive Care Unit Mortality - Severity of Illness Scores or Artificial Intelligence instruments? - Literature Review and Metanalysis

ABSTRACT

Citation

Copyright