Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Dec 10, 2024
Date Accepted: Jul 29, 2025

The final, peer-reviewed published version of this preprint can be found here:

Lessons Learned From Building a Data Platform for Longitudinal, Analytical Use Cases and Scaling to 77 German Hospitals: Implementation Report

Bockhacker M, Martens P, von Münchow C, Löser S, Günther R, Kuhlen R, Kannt O, Ortleb S

Lessons Learned From Building a Data Platform for Longitudinal, Analytical Use Cases and Scaling to 77 German Hospitals: Implementation Report

JMIR Med Inform 2025;13:e69853

DOI: 10.2196/69853

PMID: 40939633

PMCID: 12431789

Implementation report and lessons learned from building a data platform for longitudinal, analytical use cases and scaling to 77 German hospitals

  • Markus Bockhacker; 
  • Peter Martens; 
  • Clara von Münchow; 
  • Sarah Löser; 
  • Rosita Günther; 
  • Ralf Kuhlen; 
  • Olaf Kannt; 
  • Sebastian Ortleb

ABSTRACT

Background:

Increasing adoption of electronic medical records (EHR) enables research on real-world data. In Germany this has been limited to university hospitals, data from acute care hospitals below university level is lacking. To address this gap we initiated the Helios Safe Medical Data-Platform (HeSaMeDa), which aggregates and standardizes pseudonymised EHR data with patients’ consent.

Objective:

To report on the design, implementation, patient participation and lessons learned during the scaling of a research platform to incorporate consented real-world data from 77 distinct hospitals into a unified data lake.

Methods:

Due to variations in EHR adoption, IT infrastructure, software vendors, interface availability and regulatory requirements, we used an agile development cycle that involves constant, incremental standardization of data. We implemented a layered lambda infrastructure built on Apache Hadoop. Decentralized connectors ensure data minimization and pseudonymization.

Results:

We successfully scaled our data model both laterally and horizontally in 77 hospitals. However, we encountered issues during the scaling of real-time data pipelines and IHE interfaces. During the first 2 years patients were asked to consent to secondary data use 1,475,244 times during inpatient admission. We registered 1,023,633 broad consents (consent rate: 70.2%).

Conclusions:

Patients are generally willing to provide consent for secondary use of their data, but obtaining consent requires considerable effort. Building a research data platform isn’t an end goal, but rather a necessary step in collecting and standardizing longitudinal data to enable research on real-world data. Through the combination of agile development, phased rollouts and very high levels of automation, we have been able to achieve fast turnaround times for incorporating user feedback and are constantly improving data quality and standardization.


 Citation

Please cite as:

Bockhacker M, Martens P, von Münchow C, Löser S, Günther R, Kuhlen R, Kannt O, Ortleb S

Lessons Learned From Building a Data Platform for Longitudinal, Analytical Use Cases and Scaling to 77 German Hospitals: Implementation Report

JMIR Med Inform 2025;13:e69853

DOI: 10.2196/69853

PMID: 40939633

PMCID: 12431789

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.