Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Currently accepted at: Journal of Medical Internet Research

Date Submitted: Aug 22, 2024
Open Peer Review Period: Aug 23, 2024 - Oct 18, 2024
Date Accepted: Dec 18, 2024
(closed for review but you can still tweet)

This paper has been accepted and is currently in production.

It will appear shortly on 10.2196/65681

The final accepted version (not copyedited yet) is in this tab.

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Bridging Data Silos in Oncology with Modular Software for Federated Analysis on FHIR: A Multisite Implementation Study

  • Jasmin Ziegler; 
  • Marcel Erpenbeck; 
  • Timo Fuchs; 
  • Anna Saibold; 
  • Paul-Christian Volkmer; 
  • Günter Schmidt; 
  • Johanna Eicher; 
  • Peter Pallaoro; 
  • Renata De Souza Falguera; 
  • Fabio Aubele; 
  • Marlien Hagedorn; 
  • Ekaterina Vansovich; 
  • Johannes Raffler; 
  • Stephan Ringshandl; 
  • Alexander Kerscher; 
  • Julia Maurer; 
  • Brigitte Kühnel; 
  • Gerhard Schenkirsch; 
  • Marvin Kampf; 
  • Lorenz A. Kapsner; 
  • Hadieh Ghanbarian; 
  • Helmut Spengler; 
  • Iñaki Soto-Rey; 
  • Fady Albashiti; 
  • Dirk Hellwig; 
  • Maximilian Ertl; 
  • Georg Fette; 
  • Detlef Kraska; 
  • Martin Boeker; 
  • Hans-Ulrich Prokosch; 
  • Christian Gulden

ABSTRACT

Background:

Real-world data (RWD) from sources like administrative claims, electronic health records, and cancer registries offer insights into patient populations beyond the tightly regulated environment of randomized controlled trials. To leverage this and to advance cancer research, six university hospitals in Bavaria have established a joint research IT infrastructure.

Objective:

This article aims to outline the design, implementation, and deployment of a modular data transformation pipeline that transforms oncological RWD into HL7 (Health Level 7) FHIR (Fast Healthcare Interoperability Resources) format and then into a tabular format in preparation for a federated analysis (FA) across the six BZKF university hospitals.

Methods:

To harness RWD effectively, we designed a pipeline to convert the oncological basic dataset (oBDS) into HL7 FHIR format and prepare it for federated analysis. The pipeline handles diverse IT infrastructures and systems while maintaining privacy by keeping data decentralized for analysis. To assess the functionality and validity of our implementation, we defined a cohort to address two specific medical research questions. We evaluated our findings by comparing the results of the FA with reports from the Bavarian Cancer Registry and the original data from local tumor documentation systems.

Results:

We conducted a federated analysis of 17,885 cancer cases from 2021/2022. Breast cancer was the most common diagnosis at three sites, prostate cancer ranked in the top two at four sites, and malignant melanoma was notably prevalent. Gender-specific trends showed larynx and esophagus cancers were more common in males, while breast and thyroid cancers were more frequent in females. Discrepancies between the Bavarian Cancer Registry and our data, such as higher rates of malignant melanoma (5 % vs. 11 %) and lower representation of colorectal cancers (13 % vs. 7 %) likely result from differences in the time periods analyzed (2019 vs. 2021/2022) and the scope of data sources used. The Bavarian Cancer Registry reports approximately three times more cancer cases than the six university hospitals alone.

Conclusions:

The modular pipeline successfully transformed oncological RWD across six hospitals, and the federated approach preserved privacy while enabling comprehensive analysis. Future work will add support for recent oBDS versions, automate data quality checks, and integrate additional clinical data. Our findings highlight the potential of federated health data networks and lay the groundwork for future research that can leverage high-quality RWD, aiming to contribute valuable knowledge to the field of cancer research.


 Citation

Please cite as:

Ziegler J, Erpenbeck M, Fuchs T, Saibold A, Volkmer PC, Schmidt G, Eicher J, Pallaoro P, De Souza Falguera R, Aubele F, Hagedorn M, Vansovich E, Raffler J, Ringshandl S, Kerscher A, Maurer J, Kühnel B, Schenkirsch G, Kampf M, Kapsner LA, Ghanbarian H, Spengler H, Soto-Rey I, Albashiti F, Hellwig D, Ertl M, Fette G, Kraska D, Boeker M, Prokosch HU, Gulden C

Bridging Data Silos in Oncology with Modular Software for Federated Analysis on FHIR: A Multisite Implementation Study

JMIR Preprints. 22/08/2024:65681

DOI: 10.2196/preprints.65681

URL: https://preprints.jmir.org/preprint/65681

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.