Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Aug 17, 2025
Open Peer Review Period: Aug 17, 2025 - Oct 12, 2025
Date Accepted: Dec 2, 2025
(closed for review but you can still tweet)
A machine learning model based on clinical factors to predict the efficacy of first-line immunochemotherapy for patients with advanced gastric cancer: A retrospective study
ABSTRACT
Background:
The development of immunotherapy has provided new hope for patients with advanced gastric cancer (AGC). However, due to the high heterogeneity of the disease, the efficacy of first-line immunochemotherapy varies among patients. There is still a lack of simple and effective models to predict the efficacy of immunochemotherapy in this setting.
Objective:
This study aims to identify critical factors and develop predictive models to evaluate the efficacy of first-line immunochemotherapy in AGC patients using clinically available data. The goal is to offer evidence-based guidance for clinical practice and enable personalized treatment strategies.
Methods:
To evaluate the effectiveness of first-line immunochemotherapy in AGC, we retrospectively collected clinical data from The First Affiliated Hospital of Nanjing Medical University between January 2018 and October 2023. The collected data were divided into a training set (70%) and an internal validation set (30%). Additionally, a temporal validation cohort of 76 patients recruited from November 2023 to September 2024 was assembled to further evaluate the predictive performance of the models. We used univariate and multivariate Cox regression analyses, along with the Least Absolute Shrinkage and Selection Operator (LASSO) regression, and integrated clinical expertise to identify key predictors of treatment efficacy and to construct the LASSO-Cox model. We developed four models (LASSO-Cox, RSF, XGBoost, and Survival SVM) and evaluated their performance using the C-index, AUC, calibration curves, and Decision Curve Analysis (DCA). The optimal model was interpreted using Shapley Additive Explanations (SHAP), and its risk scores were used to stratify patients for Kaplan-Meier survival analysis.
Results:
Among the four prognostic models developed in this study, the RSF model demonstrated superior predictive accuracy and discrimination for progression-free survival (PFS), as evidenced by its higher AUC, C-index, continuous AUC curves, and calibration curves compared with the other three models. Additionally, DCA showed that the RSF model offered greater net clinical benefit. The SHAP results revealed that Age, Histological Subtype, the proportion of CD19+ B cells, CD16+CD56+ natural killer (NK) cells and the presence of liver metastasis as key prognostic factors influencing patient outcomes. Patients in the low-risk group, as determined by the RSF model’s risk score, exhibited a significantly higher PFS rate than those in the high-risk group, further validating the value of the RSF model for risk stratification.
Conclusions:
This study is the first to employ machine learning algorithms to develop a predictive model for the efficacy of first-line immunochemotherapy in AGC, and to identify key predictors of treatment outcome. The results indicate that the RSF model not only enables precise stratification of patients likely to benefit, but more importantly, provides quantifiable decision support for individualized clinical strategies, underscoring its potential value in clinical decision-making.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.