JMIR Preprints #60847: Advancing Privacy-Preserving Healthcare Analytics: Implementation of the Personal Health Train for Federated Deep Learning

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Advancing Privacy-Preserving Healthcare Analytics: Implementation of the Personal Health Train for Federated Deep Learning

Ananya Choudhury;
Leory Volmer;
Frank Martin;
RRR Fijten;
Leonard Wee;
Andre Dekker;
Johan van Soest

ABSTRACT

Background:

Accurate delineation of the gross tumor volume (GTV) is crucial in radiotherapy for dose calculation and precise imaging-guided treatment of lung cancer patients. Conventionally, this task has been performed manually by radiation oncologists, which can be subjective and vary among clinicians. Deep learning has enabled automated GTV segmentation, with the potential to revolutionize the radiotherapy workflow by improving efficiency and consistency, ultimately enhancing patient outcomes while reducing clinician workload. However, the adoption of deep learning based GTV segmentation tools is hindered by the challenges of data privacy and the need for large, diverse datasets across multiple institutions. Federated learning (FL) offers a promising solution, allowing collaborative development of AI models without the need to share individual subject-level data.

Objective:

The objective is to introduce an innovative federated learning infrastructure called the Personal Health Train (PHT) that includes the procedural, technical, and governance components needed to implement federated learning on real-world healthcare data, including training deep learning neural networks. The study aims to apply this federated deep learning infrastructure to the use case of gross tumor volume (GTV) segmentation on chest CT images of lung cancer patients, and present the results from a proof-of-concept experiment.

Methods:

The PHT framework addresses the challenges of data privacy concerns of data sharing by keeping data close to the source, and instead sending analysis to the data. Technologically, PHT requires three interdependent components: "tracks" (protected communication channels), "trains" (containerized software applications), and "stations" (institutional data repositories), which are supported by the open source "Vantage6" software. The study applies this federated deep learning infrastructure to the use case of GTV segmentation on chest CT images of lung cancer patients, with the introduction of an additional component called the Secure Aggregations Server, where the model averaging is done in a trusted and inaccessible environment.

Results:

In this paper we demonstrated the feasibility of executing deep learning algorithms in a federated manner using PHT and presented the results from a proof-of-concept study. The infrastructure linked 12 hospitals across 8 nations, covering 4 continents, demonstrating the scalability and global reach of the proposed approach. In the entire execution and training of the deep learning algorithm, no data has been shared outside the hospital.

Conclusions:

The findings of the proof-of-concept study, as well as the implications and limitations of the infrastructure and the results, are discussed. The application of federated deep learning to unstructured medical imaging data, facilitated by the PHT framework and Vantage6 platform, represents a significant advancement in the field. The proposed infrastructure addresses the challenges of data privacy and enables collaborative model development, paving the way for the widespread adoption of deep learning-based tools in the medical domain and beyond. The introduction of the Secure Aggregation Server implied that data leakage problems in federated learning can be prevented by careful design decisions of the infrastructure. Clinical Trial: ARtificial Intelligence for Gross Tumour vOlume Segmentation (ARGOS) ClinicalTrials.gov ID NCT05775068 Sponsor Maastricht Radiation Oncology Information provided by Andre Dekker, Maastricht Radiation Oncology (Responsible Party) https://clinicaltrials.gov/study/NCT05775068

Citation

Please cite as:

Choudhury A, Volmer L, Martin F, Fijten R, Wee L, Dekker A, Soest Jv

Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study

JMIR AI 2025;4:e60847

DOI: 10.2196/60847

PMID: 39912580

PMCID: 11843053

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR AI

Date Submitted: May 23, 2024

Open Peer Review Period: Jun 13, 2024 - Aug 8, 2024

Date Accepted: Oct 17, 2024

(closed for review but you can still tweet)

Advancing Privacy-Preserving Healthcare Analytics: Implementation of the Personal Health Train for Federated Deep Learning

ABSTRACT

Citation

Copyright