JMIR Preprints #29455: Optimal policy determination in sequential systemic and locoregional therapy of oropharyngeal squamous carcinomas: A patient-physician digital twin dyad with deep Q-learning for treatment selection

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Optimal policy determination in sequential systemic and locoregional therapy of oropharyngeal squamous carcinomas: A patient-physician digital twin dyad with deep Q-learning for treatment selection

Elisa Tardini;
Xinhua Zhang;
Guadalupe Canahuate;
Andrew Wentzel;
Abdallah S. R. Mohamed;
Lisanne Van Dijk;
Clifton D. Fuller;
G. Elisabeta Marai

ABSTRACT

Purpose: Currently, selection of patients for sequential vs. concurrent chemotherapy/radiation regimens lacks evidentiary support, and it is based on locally-optimal decisions for each step. We aim to optimize the multi-step treatment of head and neck cancer patients and to predict multiple patient survival and toxicity outcomes, and we develop, apply, and evaluate a first application of deep-Q-learning (DQL) and simulation to this problem. Patients and methods: The treatment decision DQL digital twin and the patient’s digital twin were created, trained and evaluated on a dataset of 536 oropharyngeal squamous cell carcinoma (OPC) patients with the goal of, respectively, determining the optimal treatment decisions with respect to survival and toxicity metrics, and predicting the outcomes of the optimal treatment on the patient. The models were trained on a subset of 402 patients (split randomly) and evaluated on a separate set of 134 patients. Training and evaluation of the digital twin dyad was completed in August 2020. The dataset includes 3-step sequential treatment decisions and complete relevant history of the patients cohort treated at MD Anderson Cancer Center between 2005 and 2013, with radiomics analysis performed for the segmented primary tumor volumes.

Results:

On the validation set, 87.09% mean and 90.85% median accuracy in treatment outcome prediction, matching the clinicians’ outcomes and improving (predicted) survival rate by +3.73% (95% CI: [-0.75%, +8.96%]), and dysphagia rate by +0.75% (CI: [-4.48%, +6.72%]) when following DQL treatment decisions. Conclusion: Given the prediction accuracy and predicted improvement on medically relevant outcomes yielded by this approach, this digital twin dyad of the patient-physician dynamic treatment problem has the potential of aiding physicians in determining the optimal course of treatment and in assessing its outcomes.

Citation

Please cite as:

Tardini E, Zhang X, Canahuate G, Wentzel A, Mohamed ASR, Van Dijk L, Fuller CD, Marai GE

Optimal Treatment Selection in Sequential Systemic and Locoregional Therapy of Oropharyngeal Squamous Carcinomas: Deep Q-Learning With a Patient-Physician Digital Twin Dyad

J Med Internet Res 2022;24(4):e29455

DOI: 10.2196/29455

PMID: 35442211

PMCID: 9069283

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Apr 7, 2021

Open Peer Review Period: Apr 7, 2021 - Jun 2, 2021

Date Accepted: Feb 9, 2022

(closed for review but you can still tweet)

Optimal policy determination in sequential systemic and locoregional therapy of oropharyngeal squamous carcinomas: A patient-physician digital twin dyad with deep Q-learning for treatment selection

ABSTRACT

Citation

Copyright