Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jul 29, 2020
Date Accepted: Feb 1, 2021
Date Submitted to PubMed: Feb 3, 2021

The final, peer-reviewed published version of this preprint can be found here:

Learning From Past Respiratory Infections to Predict COVID-19 Outcomes: Retrospective Study

Sang S, Sun R, Coquet J, Carmichael H, Seto T, Hernandez-Boussard T

Learning From Past Respiratory Infections to Predict COVID-19 Outcomes: Retrospective Study

J Med Internet Res 2021;23(2):e23026

DOI: 10.2196/23026

PMID: 33534724

PMCID: 7901593

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Learning from past respiratory infections to predict COVID-19 Outcomes: A retrospective study

  • Shengtian Sang; 
  • Ran Sun; 
  • Jean Coquet; 
  • Haris Carmichael; 
  • Tina Seto; 
  • Tina Hernandez-Boussard

ABSTRACT

Background:

In the clinical care of well-established diseases, randomized trials, literature and research are supplemented by clinical judgment to understand disease prognosis and inform treatment choices. In the void created by a lack of clinical experience with COVID-19, Artificial Intelligence (AI) may be an important tool to bolster clinical judgment and decision making. However, lack of clinical data restricts the design and development of such AI tools, particularly in preparation of an impending crisis or pandemic.

Objective:

This study aimed to develop and test the feasibility of a ‘patients-like-me’ framework to predict COVID-19 patient deterioration using a retrospective cohort of similar respiratory diseases.

Methods:

Our framework used COVID-like cohorts to design and train AI models that were then validated on the COVID-19 population. The COVID-like cohorts included patients diagnosed with bacterial pneumonia, viral pneumonia, unspecified pneumonia, influenza, and acute respiratory distress syndrome (ARDS) from an academic medical center, 2008-2019. Fifteen training cohorts were created using different combinations of the COVID-like cohorts with the ARDS cohort for exploratory purpose. Two machine learning (ML) models were developed, one to predict invasive mechanical ventilation (IMV) within 48 hours for each hospitalized day, and one to predict all-cause mortality at the time of admission. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). We established model interpretability by calculating SHapley Additive exPlanations (SHAP) scores to identify important features.

Results:

Compared to the COVID-like cohorts (n=16,509), the COVID-19 hospitalized patients (n=125) were significantly younger, with a higher proportion of Hispanic ethnicity, lower proportion of smoking history and fewer comorbidities (P <0.001). COVID-19 patients had a lower IMV rate (15.1 vs 23.2, P=0.016) and shorter time to IMV (2.9 vs 4.1, P <0.001) compared to the COVID-like patients. In the COVID-like training data, the top models achieved excellent performance (AUV > 0.90). Validating in the COVID-19 cohort, the best performing model of predicting IMV was the XGBoost model (AUC: 0.831) trained on the viral pneumonia cohort. Similarly, the XGBoost model trained on all four COVID-like cohorts without ARDS achieved the best performance (AUC: 0.928) in predicting mortality. Important predictors included demographic information (age), vital signs (oxygen saturation), and laboratory values (white blood count, cardiac troponin, albumin, etc.). Our models suffered from class imbalance, that resulted in high negative predictive values and low positive predictive values.

Conclusions:

We provided a feasible framework for modeling patient deterioration using existing data and AI technology to address data limitations during the onset of a novel, rapidly changing pandemic.


 Citation

Please cite as:

Sang S, Sun R, Coquet J, Carmichael H, Seto T, Hernandez-Boussard T

Learning From Past Respiratory Infections to Predict COVID-19 Outcomes: Retrospective Study

J Med Internet Res 2021;23(2):e23026

DOI: 10.2196/23026

PMID: 33534724

PMCID: 7901593

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.