Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 28, 2020
Date Accepted: Jan 31, 2022

The final, peer-reviewed published version of this preprint can be found here:

Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation

Kwon O, Na W, Kang HJ, Jun TJ, Kweon J, Park GM, Cho Y, Hur C, Chae J, Kang DY, Lee PH, Ahn JM, Park DW, Kang SJ, Lee CW, Lee SW, Park SW, Park SJ, Yang DH, Kim YH

Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation

JMIR Med Inform 2022;10(5):e26801

DOI: 10.2196/26801

PMID: 35544292

PMCID: 9133980

Electronic Medical Record-Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events after Invasive Coronary Treatment

  • Osung Kwon; 
  • Wonjun Na; 
  • Hee Jun Kang; 
  • Tae Joon Jun; 
  • Jihoon Kweon; 
  • Gyung-Min Park; 
  • YongHyun Cho; 
  • Cinyoung Hur; 
  • Jungwoo Chae; 
  • Do-Yoon Kang; 
  • Pil Hyung Lee; 
  • Jung-Min Ahn; 
  • Duk-Woo Park; 
  • Soo-Jin Kang; 
  • Cheol Whan Lee; 
  • Seung-Whan Lee; 
  • Seong-Wook Park; 
  • Seung-Jung Park; 
  • Dong Hyun Yang; 
  • Young-Hak Kim

ABSTRACT

Background:

Although there is a growing interest in prediction models based on electronic medical record (EMR), to identify patients at risk of adverse cardiac events following invasive coronary treatment, robust models fully utilizing EMR data are limited.

Objective:

We aimed to develop and validate machine-learning (ML) models using diverse fields of EMR to predict risk of 30-day adverse cardiac events after percutaneous intervention or bypass surgery.

Methods:

EMR data of 5,184,565 records of 16,793 patients at a quaternary hospital between 2006-2016, was categorized into static basic (e.g. demographics), dynamic time-series (e.g. laboratory values), and cardiac-specific data (e.g. coronary angiography). The data were randomly split into training, tuning, and testing sets in a ratio of 3:1:1. Each model was evaluated with 5-fold cross-validation and with an external EMR-based cohort at a tertiary hospital. Logistic regression (LR), random forest (RF), gradient boosting machine (GBM), and feedforward neural network (FNN) algorithms were applied. Primary outcome was 30-day mortality following invasive treatment.

Results:

GBM showed the best performance with area under the receiver operating characteristic curve (AUROC) of 0.99; RF had a similar AUROC of 0.98. AUROCs of FNN and LR were 0.96 and 0.93, respectively. GBM had the highest area under the precision-recall curve (AUPRC) of 0.80 and those of RF, LR and FNN were 0.73, 0.68, and 0.63, respectively. All models showed low Brier scores of <0.1 as well as highly fitted calibration plots, indicating a good fit of the ML-based models. On external validation, the GBM model demonstrated maximal performance with AUROCs 0.90, while FNN had AUROC of 0.85. The AUROC of LR and RF were slightly lower at 0.80, and 0.79, respectively. The AUPRCs of GBM, LR, and FNN were similar at 0.47, 0.43, and 0.41, respectively, while that of RF was lower at 0.33. All models showed low Brier scores of 0.1. Among the categories in the GBM model, time-series dynamic data demonstrated high AUROC of >0.95, contributing majorly to the excellent result

Conclusions:

Exploiting diverse fields of EMR dataset, the ML-based 30-days adverse cardiac event prediction models performed outstanding, and the applied framework could be generalized for various healthcare prediction models.ts.


 Citation

Please cite as:

Kwon O, Na W, Kang HJ, Jun TJ, Kweon J, Park GM, Cho Y, Hur C, Chae J, Kang DY, Lee PH, Ahn JM, Park DW, Kang SJ, Lee CW, Lee SW, Park SW, Park SJ, Yang DH, Kim YH

Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation

JMIR Med Inform 2022;10(5):e26801

DOI: 10.2196/26801

PMID: 35544292

PMCID: 9133980

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.