Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR AI

Date Submitted: Mar 30, 2023
Open Peer Review Period: Mar 28, 2023 - May 23, 2023
Date Accepted: Feb 10, 2024
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation

Späth J, Sewald Z, Probul N, Berland M, Almeida M, Pons N, Le Chatelier E, Gines P, Sole C, Juanola A, Pauling J, Baumbach J

Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation

JMIR AI 2024;3:e47652

DOI: 10.2196/47652

PMID: 38875678

PMCID: 11041494

Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Implementation and Evaluation

  • Julian Späth; 
  • Zeno Sewald; 
  • Niklas Probul; 
  • Magali Berland; 
  • Mathieu Almeida; 
  • Nicolas Pons; 
  • Emmanuelle Le Chatelier; 
  • Pere Gines; 
  • Cristina Sole; 
  • Adrià Juanola; 
  • Josch Pauling; 
  • Jan Baumbach

ABSTRACT

Background:

Central collection of distributed patient data is problematic due to strict privacy regulations. Especially in clinical environments, such as clinical time-to-event studies, large sample sizes are critical but usually not available at a single institution. It has been shown recently that federated learning, combined with privacy-enhancing technologies, is an excellent and privacy-preserving alternative to data sharing.

Objective:

Many algorithms for federated time-to-event analysis, such as survival support vector machines, are still unavailable and not accessible to researchers and statisticians. Making these algorithms available for federated environments is critical for obtaining large sample sizes in cross-institutional time-to-event studies to comply with current privacy regulations.

Methods:

We extended the survival support vector machine algorithm to be applicable to federated environments. We further implemented it as a FeatureCloud app, enabling it to run in the federated infrastructure provided by the FeatureCloud platform. We evaluated our implemented algorithm on three benchmark datasets and a real-world microbiome dataset and compared it to the corresponding central method.

Results:

Our federated survival SVM produces highly similar results to the centralized model on all datasets. The maximal difference between the model weights of the central model and the federated model was only 0.001, and the mean difference over all datasets was 0.0002. We further show that by including more data in the analysis through federated learning, predictions are more accurate even in the presence of site-dependent batch effects.

Conclusions:

The federated survival support vector machine extends the palette of federated time-to-event analysis methods by a robust machine learning approach. To our knowledge, the implemented FeatureCloud app is the first publicly available implementation of a federated survival support vector machine, is freely accessible for all kinds of researchers, and can be directly used within the FeatureCloud platform.


 Citation

Please cite as:

Späth J, Sewald Z, Probul N, Berland M, Almeida M, Pons N, Le Chatelier E, Gines P, Sole C, Juanola A, Pauling J, Baumbach J

Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation

JMIR AI 2024;3:e47652

DOI: 10.2196/47652

PMID: 38875678

PMCID: 11041494

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.