Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Public Health and Surveillance

Date Submitted: Dec 20, 2022
Open Peer Review Period: Dec 20, 2022 - Jan 3, 2023
Date Accepted: Jun 13, 2023
Date Submitted to PubMed: Jun 13, 2023
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Prediction Model for Postoperative Quality of Life Among Breast Cancer Survivors Along the Survivorship Trajectory From Pretreatment to 5 Years: Machine Learning–Based Analysis

Kang D, Kim H, Cho J, Kim Z, Chung M, Lee JE, Nam SJ, Kim SW, Yu J, Chae BJ, Ryu JM, Lee SK

Prediction Model for Postoperative Quality of Life Among Breast Cancer Survivors Along the Survivorship Trajectory From Pretreatment to 5 Years: Machine Learning–Based Analysis

JMIR Public Health Surveill 2023;9:e45212

DOI: 10.2196/45212

PMID: 37309655

PMCID: 10485708

Prediction model for postoperative quality of life among breast cancer survivors along the survivorship trajectory from pretreatment to 5 years: Machine learning-based analysis

  • Danbee Kang; 
  • Hyunsoo Kim; 
  • Juhee Cho; 
  • Zero Kim; 
  • Myungjin Chung; 
  • Jeong Eon Lee; 
  • Seok Jin Nam; 
  • Seok Won Kim; 
  • Jonghan Yu; 
  • Byung Joo Chae; 
  • Jai Min Ryu; 
  • Se Kyung Lee

ABSTRACT

Background:

Breast cancer is the most common cancer and cause of cancer death in women. Although survival rates have improved, unmet psychosocial needs remain challenging because the quality of life (QoL) and QoL-related factors change over time. In addition, traditional statistical models have limitations in identifying factors associated with QoL over time, particularly concerning the physical, psychological, economic, spiritual, and social dimensions.

Objective:

This study aimed to identify patient-centered factors associated with QoL among breast cancer patients using a machine learning algorithm to analyze data collected along different survivorship trajectories.

Methods:

The study used two datasets: the first data set was the cross-sectional survey data from the Breast cancer Information Grand round for Survivorship (BIG-S) study, which recruited consecutive breast cancer survivors who visited the outpatient breast cancer clinic at the Samsung Medical Center in Seoul, Korea, between 2018 and 2019. The second data set was the longitudinal cohort data from the Beauty Education for diStressed breasT cancer (BEST) cohort study, which was conducted at two university-based cancer hospitals in Seoul, Korea between 2011 and 2016. QoL was measured using EORTC QLQ-C30 questionnaire. Feature importance was interpreted using Shapley Additive Explanations (SHAP). The final model was selected based on the highest mean area under the receiver operating characteristic curve (AUC). The analyses were performed using the Python 3.7, scikit-learn package, and TensorFlow Keras framework.

Results:

The study included 6,265 breast cancer survivors in the training dataset and 432 patients in the validation set. Mean age was 50.6 years and 46.8% had stage 1 cancer. In the training dataset, 48.3% survivors had poor QoL. The study developed machine learning models for QoL prediction based on six algorithms. Performance was good for all survival trajectories: overall (AUC = 0.823), baseline (AUC = 0.835), under 1 year (AUC = 0.860), between 2 and 3 years (AUC = 0.808), between 3 and 4 years (AUC = 0.820), and between 4 and 5 years (AUC = 0.826). Emotional and physical functions were the most important features before surgery and under 1 year after surgery, respectively. Fatigue was the most important feature between 1–4 years. Despite the survival period, hopefulness was the most influential feature on QoL. External validation of the models showed good performance with AUCs between 0.770 and 0.862.

Conclusions:

The study identified important factors associated with QoL among breast cancer survivors across different survival trajectories. Understanding the changing trends of these factors could help to intervene more precisely and timely, and potentially prevent or alleviate QoL-related issues for patients. The good performance of our machine learning models in both training and external validation sets suggests the potential utility of this approach in identifying patient-centered factors and improving survivorship care.


 Citation

Please cite as:

Kang D, Kim H, Cho J, Kim Z, Chung M, Lee JE, Nam SJ, Kim SW, Yu J, Chae BJ, Ryu JM, Lee SK

Prediction Model for Postoperative Quality of Life Among Breast Cancer Survivors Along the Survivorship Trajectory From Pretreatment to 5 Years: Machine Learning–Based Analysis

JMIR Public Health Surveill 2023;9:e45212

DOI: 10.2196/45212

PMID: 37309655

PMCID: 10485708

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.