Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Jan 22, 2020
Date Accepted: Mar 7, 2021

The final, peer-reviewed published version of this preprint can be found here:

Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study

Kong G, Wu J, Chu H, Yang C, Lin Y, Lin K, Shi Y, Wang H, Zhang L

Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study

JMIR Med Inform 2021;9(5):e17886

DOI: 10.2196/17886

PMID: 34009135

PMCID: 8173398

Predicting prolonged length of hospital stay for peritoneal dialysis patients: a stacked generalization method

  • Guilan Kong; 
  • Jingyi Wu; 
  • Hong Chu; 
  • Chao Yang; 
  • Yu Lin; 
  • Ke Lin; 
  • Ying Shi; 
  • Haibo Wang; 
  • Luxia Zhang

ABSTRACT

Background:

Given the increasing number of peritoneal dialysis patients and the importance of optimal resource allocation, an effective length of hospital stay (LOS) prediction model is needed by physicians.

Objective:

We aimed to construct an LOS prediction model for peritoneal dialysis patients using stacked generalization (stacking), and compare the prediction performance of the stacking model with the benchmark logistics regression (LR) model and its three base models: support vector machine (SVM), random forest (RF) and K-nearest neighbor (KNN).

Methods:

Patient data collected using the Hospital Quality Monitoring System (HQMS) in China were used to develop and validate the models. We constructed the stacking model using three machine learning methods: SVM, RF, and KNN as its base models and LR as its meta-model. In addition, a LR model was constructed as benchmark model. The prediction performance of the ensemble model was compared with each base model and the benchmark model. Five-fold cross-validation was employed to develop and validate the models. In performance comparison, the prediction performance of each model was measured using Brier score, area under the receiver operating characteristic curve (AUROC), estimated calibration index (ECI), and geometric mean (Gm). Further, calibration plot was employed to visually demonstrate the calibration power of each model.

Results:

The final cohort in our study consisted of 22,859 eligible peritoneal dialysis patients, among which 25.2% had a prolonged LOS (pLOS). The results showed that the stacking model achieved the best overall performance (Brier score, 0.157), discrimination (AUROC, 0.756), and calibration (ECI, 7.846); the second-best balanced accuracy (Gm, 0.688). Compared with the benchmark LR model, the stacking model outperformed in terms of all performance measures.

Conclusions:

The stacking-based LOS prediction model has superior prediction performance compared to its base models and traditional LR model. The stacking model has great potential to assist physicians to do LOS prediction for peritoneal dialysis patients, and it may be helpful in assisting physicians to optimally allocate medical resources and achieve better clinical outcomes. However, a prospective study is still needed to evaluate the stacked generalization model before its use in clinical practice.


 Citation

Please cite as:

Kong G, Wu J, Chu H, Yang C, Lin Y, Lin K, Shi Y, Wang H, Zhang L

Predicting Prolonged Length of Hospital Stay for Peritoneal Dialysis–Treated Patients Using Stacked Generalization: Model Development and Validation Study

JMIR Med Inform 2021;9(5):e17886

DOI: 10.2196/17886

PMID: 34009135

PMCID: 8173398

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.