JMIR Preprints #37689: Identification of the risk of sepsis in cancer patients using digital health care records

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Identification of the risk of sepsis in cancer patients using digital health care records

Donghun Yang;
Jimin Kim;
Junsang Yoo;
Won Chul Cha;
Hyojung Paik

ABSTRACT

Background:

Sepsis is diagnosed in millions of people every year, resulting in high mortality rate. Although sepsis patients present multimorbid conditions, including cancers, sepsis predictions have mainly focused on patients with severe injuries.

Objective:

Here, we present a machine learning-based approach to identify sepsis risk in cancer patients using electronic health records (EHRs).

Methods:

We utilized anonymized EHRs from the Samsung Medical Center in Korea, including 8,580 cancer patient records in longitudinal manner (2014~2019). To build a prediction model based on physical status that would differ between sepsis and nonsepsis patients, we analyzed 2,462 laboratory (lab) test results and 2,266 medication prescriptions using a graph network and statistical analysis. Based on the results, the model was trained with lab tests and medication relationships.

Results:

Sepsis patients showed differential medication trajectories and physical status. For example, in the network-based analysis, narcotic analgesics were prescribed more often in the sepsis group, along with other drugs. Likewise, 35 types of lab test, including albumin, globulin, and prothrombin time, showed significantly different distributions between sepsis and nonsepsis patients (p value<0.0001). Our model outperformed the model trained using only common EHRs, showing improved accuracy, area under the receiver operating characteristic (AUROC), and F1-scores up to 11.9%, 11.3%, and 13.6%, respectively. (Accuracy: 0.692, AUROC: 0.753, and F1-score: 0.602 for the random forest-based model).

Conclusions:

We elucidated that lab tests and medication relationships can be used as efficient features for predicting sepsis in cancer patients. Consequently, identification of sepsis risk in cancer patients using EHRs and machine learning is feasible.

Citation

Please cite as:

Yang D, Kim J, Yoo J, Cha WC, Paik H

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

JMIR Med Inform 2022;10(6):e37689

DOI: 10.2196/37689

PMID: 35704364

PMCID: 9244654

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 3, 2022

Date Accepted: May 17, 2022

Identification of the risk of sepsis in cancer patients using digital health care records

ABSTRACT

Citation

Copyright