Currently submitted to: JMIR Medical Informatics
Date Submitted: Mar 3, 2026
Open Peer Review Period: Mar 13, 2026 - May 8, 2026
(currently open for review)
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
DCAT-based metadata schema for a National Health Data Catalogue
ABSTRACT
Background:
Rich, standardised metadata is essential for improving the findability, accessibility, interoperability, and reusability (FAIR) of health research resources. The European Health Data Space (EHDS) requires harmonised catalogue metadata across countries, yet national implementations demonstrating how DCAT‑based standards can be operationalised in practice are still limited.
Objective:
To develop, document, and demonstrate a DCAT‑based metadata schema for a National Health Data Catalogue, using the Dutch Health‑RI National Health Data Infrastructure as a concrete implementation example, and to assess how this approach supports interoperability and FAIR‑aligned metadata publication.
Methods:
Metadata requirements were gathered from Dutch health institutions and mapped to DCAT, DCAT‑AP, HealthDCAT‑AP and DCAT‑AP‑NL. A multi‑stakeholder modelling process involving semantic experts, ontology engineers and data stewards produced a core schema and domain‑specific extensions. RDF and SHACL were used for validation, and FAIR Data Points enabled decentralised metadata publication. Several pilot use cases were onboarded to evaluate applicability, interoperability and usability in real‑world settings.
Results:
The resulting schema comprises DCAT‑aligned classes and expanded mandatory and recommended fields aligned with HealthDCAT‑AP, the metadata model supported by EHDS. The model supports both general catalogue metadata and evolving domain‑specific extensions. Pilot implementations across Dutch institutions demonstrated improved metadata consistency, enhanced resource discoverability and successful interoperability between local FAIR Data Points and the National Health Data Catalogue.
Conclusions:
The developed DCAT‑based schema provides a scalable, standards‑aligned foundation for a National Health Data Catalogue and supports cross‑infrastructure interoperability mandated by the EHDS. The Dutch implementation shows how structured metadata and coordinated national governance can enhance FAIRness and improve access to health research resources. The approach offers a practical template for national Health Data Access Bodies across Europe. Future work includes completing domain‑specific extensions, increasing automation in metadata generation and validation, and strengthening documentation and training to support sustainable, community‑driven adoption.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.