Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Mar 3, 2022
Date Accepted: May 17, 2022

The final, peer-reviewed published version of this preprint can be found here:

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

Yang D, Kim J, Yoo J, Cha WC, Paik H

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

JMIR Med Inform 2022;10(6):e37689

DOI: 10.2196/37689

PMID: 35704364

PMCID: 9244654

Identification of the risk of sepsis in cancer patients using digital health care records

  • Donghun Yang; 
  • Jimin Kim; 
  • Junsang Yoo; 
  • Won Chul Cha; 
  • Hyojung Paik

ABSTRACT

Background:

Sepsis is diagnosed in millions of people every year, resulting in high mortality rate. Although sepsis patients present multimorbid conditions, including cancers, sepsis predictions have mainly focused on patients with severe injuries.

Objective:

Here, we present a machine learning-based approach to identify sepsis risk in cancer patients using electronic health records (EHRs).

Methods:

We utilized anonymized EHRs from the Samsung Medical Center in Korea, including 8,580 cancer patient records in longitudinal manner (2014~2019). To build a prediction model based on physical status that would differ between sepsis and nonsepsis patients, we analyzed 2,462 laboratory (lab) test results and 2,266 medication prescriptions using a graph network and statistical analysis. Based on the results, the model was trained with lab tests and medication relationships.

Results:

Sepsis patients showed differential medication trajectories and physical status. For example, in the network-based analysis, narcotic analgesics were prescribed more often in the sepsis group, along with other drugs. Likewise, 35 types of lab test, including albumin, globulin, and prothrombin time, showed significantly different distributions between sepsis and nonsepsis patients (p value<0.0001). Our model outperformed the model trained using only common EHRs, showing improved accuracy, area under the receiver operating characteristic (AUROC), and F1-scores up to 11.9%, 11.3%, and 13.6%, respectively. (Accuracy: 0.692, AUROC: 0.753, and F1-score: 0.602 for the random forest-based model).

Conclusions:

We elucidated that lab tests and medication relationships can be used as efficient features for predicting sepsis in cancer patients. Consequently, identification of sepsis risk in cancer patients using EHRs and machine learning is feasible.


 Citation

Please cite as:

Yang D, Kim J, Yoo J, Cha WC, Paik H

Identifying the Risk of Sepsis in Patients With Cancer Using Digital Health Care Records: Machine Learning–Based Approach

JMIR Med Inform 2022;10(6):e37689

DOI: 10.2196/37689

PMID: 35704364

PMCID: 9244654

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.