Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Sep 25, 2020
Date Accepted: Jan 27, 2021
Date Submitted to PubMed: Feb 3, 2021

The final, peer-reviewed published version of this preprint can be found here:

Development and Validation of a Machine Learning Approach for Automated Severity Assessment of COVID-19 Based on Clinical and Imaging Data: Retrospective Study

Quiroz JC, Feng YZ, Cheng ZY, Rezazadegan D, Chen PK, Lin QT, Qian L, Liu XF, Berkovsky S, Coiera E, Song L, Qiu XM, Liu S, Cai XR

Development and Validation of a Machine Learning Approach for Automated Severity Assessment of COVID-19 Based on Clinical and Imaging Data: Retrospective Study

JMIR Med Inform 2021;9(2):e24572

DOI: 10.2196/24572

PMID: 33534723

PMCID: 7879715

Automated Severity Assessment of COVID-19 based on Clinical and Imaging Data: Algorithm Development and Validation

  • Juan Carlos Quiroz; 
  • You-Zhen Feng; 
  • Zhong-Yuan Cheng; 
  • Dana Rezazadegan; 
  • Ping-Kang Chen; 
  • Qi-Ting Lin; 
  • Long Qian; 
  • Xiao-Fang Liu; 
  • Shlomo Berkovsky; 
  • Enrico Coiera; 
  • Lei Song; 
  • Xiao-Ming Qiu; 
  • Sidong Liu; 
  • Xiang-Ran Cai

ABSTRACT

Background:

Coronavirus disease 2019 (COVID-19) has overwhelmed health systems worldwide. It is important to identify severe cases as early as possible, so that resources can be mobilized and treatment can be escalated.

Objective:

This study aims to develop a machine learning approach for automated severity assessment of COVID-19 patients based on clinical and imaging data.

Methods:

Clinical data—demographics, signs, symptoms, comorbidities and blood test results—and chest CT scans of 346 patients from two hospitals in the Hubei province, China, were used to develop machine learning models for automated severity assessment of diagnosed COVID-19 cases. We compared the predictive power of clinical and imaging data by testing multiple machine learning models, and further explored the use of four oversampling methods to address the imbalance distribution issue. Features with the highest predictive power were identified using the SHAP framework.

Results:

Imaging features had the strongest impact on the model output, while a combination of clinical and imaging features yielded the best performance overall. The identified predictive features were consistent with findings from previous studies. Oversampling yielded mixed results, although it achieved the best model performance in our study. Targeting differentiation between mild and severe cases, logistic regression models achieved the best performance on clinical features (AUC:0.848, sensitivity:0.455, specificity:0.906), imaging features (AUC:0.926, sensitivity:0.818, specificity:0.901) and the combined features (AUC:0.950, sensitivity:0.764, specificity:0.919). The SMOTE oversampling method further improved the performance of the combined features to AUC of 0.960 (sensitivity:0.845, specificity:0.929).

Conclusions:

This study indicates that clinical and imaging features can be used for automated severity assessment of COVID-19 patients and have the potential to assist with triaging COVID-19 patients and prioritizing care for patients at higher risk of severe cases.


 Citation

Please cite as:

Quiroz JC, Feng YZ, Cheng ZY, Rezazadegan D, Chen PK, Lin QT, Qian L, Liu XF, Berkovsky S, Coiera E, Song L, Qiu XM, Liu S, Cai XR

Development and Validation of a Machine Learning Approach for Automated Severity Assessment of COVID-19 Based on Clinical and Imaging Data: Retrospective Study

JMIR Med Inform 2021;9(2):e24572

DOI: 10.2196/24572

PMID: 33534723

PMCID: 7879715

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.