Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Jan 6, 2022
Open Peer Review Period: Jan 5, 2022 - Mar 2, 2022
Date Accepted: May 2, 2022
(closed for review but you can still tweet)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Multi-Task Learning with Recurrent Neural Networks for ARDS Prediction using only EHR Data: Model Development and Validation Study
ABSTRACT
Background:
Acute Respiratory Distress Syndrome (ARDS) is a condition that is often considered to have broad and subjective diagnostic criteria and is associated with significant mortality and morbidity. Early and accurate prediction of ARDS and related conditions such as hypoxemia and sepsis could allow timely administration of therapies, leading to improved patient outcomes.
Objective:
To perform an exploration of how multi-label classification in the clinical setting can take advantage of the underlying dependencies between ARDS and related conditions to improve early prediction of ARDS.
Methods:
The electronic health record dataset included 40,073 patient encounters from 7 hospitals from 4/20/2018 to 3/17/2021. A recurrent neural network (RNN) was trained using data from 5 hospitals, and external validation was conducted on data from 2 hospitals. In addition to ARDS, 12 target labels for related conditions such as sepsis, hypoxemia and Covid-19 were used to train the model to classify a total of 13 outputs. As a comparator, XGBoost models were developed for each of the 13 target labels. Model performance was assessed using the area under the receiver operating characteristic (AUROC). Heatmaps to visualize attention scores were generated to provide interpretability to the NNs. Finally, cluster analysis was performed to identify potential phenotypic subgroups of ARDS patients.
Results:
The single RNN model trained to classify 13 outputs outperformed the XGBoost model for ARDS prediction, achieving an AUROC of 0.842 on the external test sets. Models trained on an increasing number of tasks resulted in increasing performance. Earlier diagnosis of ARDS nearly doubled the rate of in-hospital survival. Cluster analysis revealed distinct ARDS subgroups, some of which had similar mortality rates but different clinical presentations.
Conclusions:
The RNN model presented in this paper can be used as an early warning system to stratify patients who are at risk of developing one of the multiple risk outcomes, hence providing practitioners with means to take early action.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.