Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Nov 4, 2019
Date Accepted: Jul 7, 2020

The final, peer-reviewed published version of this preprint can be found here:

Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study

Hsu CN, Liu CL, Tain YL, Kuo CY, Lin YC

Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study

J Med Internet Res 2020;22(8):e16903

DOI: 10.2196/16903

PMID: 32749223

PMCID: 7435690

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Machine learning for risk prediction of community-acquired acute kidney injury hospitalization: Development and validation of a prediction risk score using electronic health records

  • Chien-Ning Hsu; 
  • Chien-Liang Liu; 
  • You-Lin Tain; 
  • Chin-Yu Kuo; 
  • Yun-Chun Lin

ABSTRACT

Background:

Community-acquired acute kidney injury (CA-AKI)-associated hospitalizations impose significant health care needs and in-hospital mortality. However, most risk predictions have focused on AKI in a specific group of patients during hospitalization; there is limited knowledge on the baseline risk in the general population for preventing CA-AKI-associated hospitalization.

Objective:

To gain further insight into risk exploration, this study aimed to develop, validate, and set a scoring system to facilitate health professionals in enabling early recognition and intervention to prevent permanent kidney damage using different machine learning techniques.

Methods:

A nested case-control study design was employed using electronic health records derived from a group of Chang Gung Memorial Hospitals in Taiwan from 2010-2017 to identify 234,867 adults with at least two measures of serum creatinine at hospital admissions. Patients were classified into derivation cohort (2010-2016) and temporal validation cohort (2017). Patients with first episode of CA-AKI at hospital admission were classified into the case group and those without CA-AKI in the control group. A total of 47 potential candidate variables, including age, gender, prior use of nephrotoxic medications, Charlson comorbid conditions, commonly measured laboratory results, and recent use of health services were tested to develop a CA-AKI hospitalization risk model. Permutation-based selection with extreme gradient boost (XGBoost) and least absolute shrinkage and selection operator (LASSO) algorithms both determined the same top 10 important features.

Results:

The discriminative ability of the risk model was assessed by the area under the receiver operating characteristic curve (AUC), and the predictive CA-AKI risk model derived by logistic regression algorithm achieved an AUC of 0.767 (95% confidence interval [CI] 0.764–0.770) on derivation and of 0.761 on validation for any stage of AKI with 19.2% and 96.1% of the positive and negative predictive values. The risk model for the prediction of CA-AKI stages 2 and 3 had 0.818 of AUC on validation cohort with 13.3% and 98.4% of the positive and negative predictive values. The metrics above were evaluated at the cut-off value of 7.993, which was a threshold to discriminate the risk of AKI.

Conclusions:

Machine learning-generated risk score model can identify patients at risk of developing CA-AKI related hospitalization through a routine care data driven approach. The validated multivariate risk assessment tool could help clinicians to stratify patients in primary care, and to provide monitoring and early intervention to prevent AKI and improve the quality of AKI care in the general population. Clinical Trial: NA


 Citation

Please cite as:

Hsu CN, Liu CL, Tain YL, Kuo CY, Lin YC

Machine Learning Model for Risk Prediction of Community-Acquired Acute Kidney Injury Hospitalization From Electronic Health Records: Development and Validation Study

J Med Internet Res 2020;22(8):e16903

DOI: 10.2196/16903

PMID: 32749223

PMCID: 7435690

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.