JMIR Preprints #26139: The effectiveness of using Multi-prognostic index (MPI) domain scores, clinical data and machine learning to improve 12-month mortality risk prediction in older hospitalized patients

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

The effectiveness of using Multi-prognostic index (MPI) domain scores, clinical data and machine learning to improve 12-month mortality risk prediction in older hospitalized patients

Richard Woodman;
Kimberley Bryant;
Michael J Sorich;
Alberto Pilotto;
Arduino Aleksander Mangoni

ABSTRACT

Background:

The Multidimensional Prognostic Index (MPI) is an aggregate comprehensive geriatric assessment scoring system derived from eight domains, that predicts adverse outcomes including 12-month mortality (12MM). However, prediction accuracy, using the 3 MPI categories (mild, moderate, severe risk) as per previous investigations was relatively poor in a recent study with older hospitalized Australian patients. Prediction modelling using the component domains of the MPI together with additional clinical features and Machine Learning (ML) algorithms might improve prediction accuracy

Objective:

To assess whether prediction accuracy for 12MM using logistic regression with maximum likelihood estimation (LR-MLE) with the 3-category MPI together with age and gender (feature-set 1) can be improved with the addition of 10 clinical features (sodium, haemoglobin, albumin, creatinine, urea, urea/creatinine ratio, estimated glomerular filtration rate, C-reactive protein, body mass index and anticholinergic risk score) (feature-set 2), and the replacement of the 3-category MPI in feature-sets 1 and 2 by the eight separate MPI domains (feature-sets 3 and 4 respectively). To also assess prediction accuracy of ML algorithms using the same feature-sets.

Methods:

MPI and clinical features were collected in patients aged ≥65 years admitted to either General Medical or Acute Care of the Elderly wards of a South Australian hospital between September 2015 and February 2017. The diagnostic accuracy of LR-MLE was assessed together with nine ML algorithms: decision-trees, random-forests, eXtreme gradient-boosting (XGBoost), support-vector-machines, naïve-bayes, k-nearest-neighbours, ridge regression, logistic regression without regularisation and neural-networks. A 70:30 Training:Test split of the data and a grid-search of hyper-parameters with 10-fold cross-validation was employed during model training of the ML algorithms. Area-under-curve (AUC) was used to assess prediction accuracy.

Results:

A total of 737 patients (F:M=50.2%/49.8%) with median (IQR) age 80 (72-86) years had complete MPI data recorded on admission and complete 12-month follow-up obtained. The area-under-the receiver-operating-curve (AUC) for LR-MLE was 0.632, 0.688, 0.738 and 0.757 for feature-sets 1 to 4 respectively. The best overall accuracy for the nine ML algorithms was obtained using the XGBoost algorithm (0.635, 0.706, 0.756 and 0.757 for feature-sets 1 to 4 respectively).

Conclusions:

The use of MPI domains (feature-sets 3 and 4) with LR-MLE considerably improved prediction accuracy compared to that obtained using the traditional 3-category MPI. The XGBoost ML algorithm slightly improved accuracy compared to LR-MLE with feature-sets 1-3 but not with feature-set 4. Adding clinical data also provided small gains in accuracy for LR-MLE and some, but not all ML algorithms. Consideration should be given to using the underlying MPI domains of aggregate scoring systems, additional clinical data and ML algorithms when assessing the risk of 12MM. Clinical Trial: N/AMachine learning, Multidimensional Prognostic Index, mortality, diagnostic accuracy, XGBoost

Citation

Please cite as:

Woodman R, Bryant K, Sorich MJ, Pilotto A, Mangoni AA

Use of Multiprognostic Index Domain Scores, Clinical Data, and Machine Learning to Improve 12-Month Mortality Risk Prediction in Older Hospitalized Patients: Prospective Cohort Study

J Med Internet Res 2021;23(6):e26139

DOI: 10.2196/26139

PMID: 34152274

PMCID: 8277374

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Dec 1, 2020

Date Accepted: May 13, 2021

The effectiveness of using Multi-prognostic index (MPI) domain scores, clinical data and machine learning to improve 12-month mortality risk prediction in older hospitalized patients

ABSTRACT

Citation

Copyright