Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: May 23, 2020
Date Accepted: Aug 16, 2020

The final, peer-reviewed published version of this preprint can be found here:

Prognostic Machine Learning Models for First-Year Mortality in Incident Hemodialysis Patients: Development and Validation Study

Sheng K, Yao X, Li J, He Y, Zhang P, Chen J

Prognostic Machine Learning Models for First-Year Mortality in Incident Hemodialysis Patients: Development and Validation Study

JMIR Med Inform 2020;8(10):e20578

DOI: 10.2196/20578

PMID: 33118948

PMCID: 7661257

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Machine learning for the prediction of first-year mortality in incident hemodialysis patients: machine learning study

  • Kaixiang Sheng; 
  • Xi Yao; 
  • Jiawei Li; 
  • Yongchun He; 
  • Ping Zhang; 
  • Jianghua Chen

ABSTRACT

Background:

The first-year survival among hemodialysis (HD) patients remains poor. The current mortality risk scores for HD patients employ regression techniques and have limited applicability and robustness.

Objective:

We aimed to develop a machine learning model utilizing clinical factors to predict first-year mortality in HD patients that could assist physicians in classifying high-risk patients.

Methods:

Training and testing cohorts consisted of 5351 incident HD patients from a single center and 5828 incident HD patients from 97 renal centers, respectively. The outcome was all-cause mortality during the first year of dialysis. eXtreme Gradient Boosting (XGBoost) was used for algorithm training and validation. Two models were established based on the data obtained at dialysis initiation (model 1) and data from 0-3 months after dialysis initiation (model 2). Ten-fold cross-validation was applied to each model. The C statistic (area under the receiver operating characteristic curve, AUC) was used to assess the predictive ability of the models.

Results:

In the training and testing cohorts, 585 (10.93%) and 764 (13.11%) patients, respectively, died during the first-year follow-up. Fifteen most important features were selected from 42 candidate features. The performance of model 1 (C statistic of 0.8318, 95% CI 0.7771 - 0.8374) was similar to that of model 2 (C statistic of 0.8530, 95% CI 0.8057 - 0.8593).

Conclusions:

We developed and validated two machine learning models to predict first-year mortality in HD patients. Both of the two models could be used to stratify high-risk patients at the early stages of dialysis.


 Citation

Please cite as:

Sheng K, Yao X, Li J, He Y, Zhang P, Chen J

Prognostic Machine Learning Models for First-Year Mortality in Incident Hemodialysis Patients: Development and Validation Study

JMIR Med Inform 2020;8(10):e20578

DOI: 10.2196/20578

PMID: 33118948

PMCID: 7661257

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.