JMIR Preprints #81424: Time-dynamic AI models to predict quality of life in breast cancer patients: Development and validation study using the EORTC BALANCE cohort

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Time-dynamic AI models to predict quality of life in breast cancer patients: Development and validation study using the EORTC BALANCE cohort

Niclas J Hubel;
Thijs G W van der Heijden;
Benjamin Murauer;
Belle H de Rooij;
Kelly M de Ligt;
Helena M Verkooijen;
Sofie AM Gernaat;
Meeke Hoedjes;
Volker Arndt;
Lonneke V van de Poll-Franse;
Bernhard Holzner;
Jens Lehmann

ABSTRACT

Background:

Breast cancer patients often experience health-related quality of life (HRQoL) impairments that remain difficult to predict on an individual level. Prediction models can aid to understand individual survivorship trajectories. However, current prognostic models are based on fixed intervals, limiting their utility in clinical follow-up schedules.

Objective:

This study aimed to develop and externally validate time-dynamic machine learning (ML) models that predict clinically relevant HRQoL impairments in non-metastatic breast cancer patients.

Methods:

Using the pooled multi-cohort EORTC BALANCE (big data in patients with breast cancer) dataset (N=6,316) containing repeated HRQoL measurements (EORTC QLQ-C30), we constructed over 70,000 patient assessment pairs. ML algorithms were trained using the earlier HRQoL assessment and clinical data to predict dichotomized impairments in QLQ-C30 domains at the later assessment between two weeks and five years ahead. The best performing model was determined via the Area Under the Receiver Operating Characteristic Curve (AUC) in the internal validation and externally validated in an independent cohort of the BALANCE dataset, in which the calibration and predictive performance in risk groups (patients: post menopause, with financial difficulties, with obesity, with 2 or more comorbidities, with lower educational status, with frailty) were also evaluated.

Results:

ML models showed good discrimination (AUC 0.64-0.84) across most domains, especially for persistent symptoms like fatigue, financial difficulties, or functioning scales. Gradient boosting models performed best, but tended to be overconfident, with poor calibration for low-prevalence symptoms like diarrhoea or constipation. Model performance varied by risk group (e.g., lower education, frailty), though no group consistently performed poorly. Performance remained stable across time windows, with prior HRQoL being the strongest predictor at the respective scale level, while clinical variables such as the type of treatment were less important for prediction.

Conclusions:

Time-dynamic ML models can support personalized HRQoL prediction in breast cancer care. Future improvements should focus on calibration and fairness to enable equitable, clinically meaningful implementation.

Citation

Please cite as:

Hubel NJ, van der Heijden TGW, Murauer B, de Rooij BH, de Ligt KM, Verkooijen HM, Gernaat SA, Hoedjes M, Arndt V, van de Poll-Franse LV, Holzner B, Lehmann J

Time-Dynamic AI Models to Predict Quality of Life in Patients With Breast Cancer: Development and Validation Study Using the EORTC BALANCE Cohort

J Med Internet Res 2026;28:e81424

DOI: 10.2196/81424

PMID: 42060909

PMCID: 13132481

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Aug 18, 2025

Date Accepted: Feb 4, 2026

Time-dynamic AI models to predict quality of life in breast cancer patients: Development and validation study using the EORTC BALANCE cohort

ABSTRACT

Citation

Copyright