Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Apr 1, 2025
Open Peer Review Period: Apr 1, 2025 - May 27, 2025
Date Accepted: Apr 30, 2025
Date Submitted to PubMed: Apr 30, 2025
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study

Kotula CA, Martin J, Carey KA, Edelson DP, Dligach D, Mayampurath A, Afshar M, Churpek MM

Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study

J Med Internet Res 2025;27:e75340

DOI: 10.2196/75340

PMID: 40499139

PMCID: 12176310

Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: An Observational Cohort Study

  • Charles A Kotula; 
  • Jennie Martin; 
  • Kyle A Carey; 
  • Dana P Edelson; 
  • Dmitriy Dligach; 
  • Anoop Mayampurath; 
  • Majid Afshar; 
  • Matthew M Churpek

ABSTRACT

Background:

Implementing machine learning models to identify clinical deterioration on the wards is associated with decreased morbidity and mortality. However, these models have high false positive rates and only use structured data.

Objective:

We aim to compare models with and without information from clinical notes for predicting deterioration.

Methods:

Adults admitted to the wards at the University of Chicago (development cohort) and University of Wisconsin-Madison (external validation cohort) were included. Predictors consisted of structured and unstructured variables extracted from notes as Concept Unique Identifiers (CUIs). We parameterized CUIs in five ways: Standard Tokenization (ST), ICD Rollup using Tokenization (ICDR-T), ICD Rollup using Binary Variables (ICDR-BV), CUIs as SapBERT Embeddings (SE), and CUI Clustering using SapBERT embeddings (CC). Each parameterization method combined with structured data and structured data-only were compared for predicting intensive care unit transfer or death in the next 24 hours using deep recurrent neural networks.

Results:

The development (UC) cohort included 284,302 patients, while the external validation (UW) cohort included 248,055. In total, 4.9% (N=26,281) of patients experienced the outcome. The SE model achieved the highest AUPRC (0.208), followed by CC (0.199) and the structured-only model (0.199), ICDR-BV (0.194), ICDR-T (0.166), and ST (0.158). The CC and structured-only models achieved the highest AUROC (0.870), followed by ICDR-T (0.867), ICDR-BV (0.866), ST (0.860), and SE (0.859). In terms of sensitivity and positive predictive value, the CC model achieved the greatest positive predictive value (12.53%) and sensitivity (52.15%) at the cutoff that flagged 5% of the observations in the test set. At the 15% cutoff, the ICDR-T, CC, and ICDR-BV models tied for the highest positive predictive value at 5.67%, while their sensitivities were 70.95%, 70.92%, and 70.86%, respectively. All models were well calibrated, achieving Brier scores in the range of 0.011-0.012. The modified IG method revealed that CUIs corresponding to terms such as “NPO – Nothing by mouth”, “Chemotherapy”, “Transplanted tissue”, and “Dialysis procedure” were most predictive of deterioration.

Conclusions:

A multimodal model combining structured data with embeddings using SapBERT had the highest AUPRC, but performance was similar between models with and without CUIs. Although the addition of CUIs from notes to structured data did not meaningfully improve model performance for predicting clinical deterioration, models using CUIs could provide clinicians with relevant information and additional clinical context for supporting decision-making.


 Citation

Please cite as:

Kotula CA, Martin J, Carey KA, Edelson DP, Dligach D, Mayampurath A, Afshar M, Churpek MM

Comparison of Multimodal Deep Learning Approaches for Predicting Clinical Deterioration in Ward Patients: Observational Cohort Study

J Med Internet Res 2025;27:e75340

DOI: 10.2196/75340

PMID: 40499139

PMCID: 12176310

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.