JMIR Preprints #56237: Making metadata machine-readable as the first step to FAIR population health data

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Making metadata machine-readable as the first step to FAIR population health data

David Amadi;
Sylvia Kiwuwa-Muyingo;
Tathagata Bhattacharjee;
Amelia Taylor;
Agnes Kiragga;
Michael Ochola;
Chifundo Kanjala;
Arofan Gregor;
Keith Tomlin;
Jim Todd;
Jay Greenfield

ABSTRACT

Background:

Metadata describes and provides context for other data and plays a pivotal role in enabling the FAIR (Findability, Accessibility, Interoperability, and Reusability) data principles. By providing comprehensive and machine-readable descriptions of digital resources, metadata empowers both machines and human users to seamlessly discover, access, integrate, and reuse data or content across diverse platforms and applications. However, the limited accessibility and machine-interpretability of existing metadata for population health data hinder effective data discovery and reuse.

Objective:

To address these challenges, we propose a comprehensive framework utilizing standardized formats, vocabularies, and protocols to render population health data machine-readable, significantly enhancing its FAIRness and enabling seamless discovery, access, and integration across diverse platforms and research applications.

Methods:

The framework implements a three-stage approach: 1. DDI (data documentation initiative) Integration: Leveraging the DDI Codebook detailed and structured metadata is documented for data and associated assets, ensuring transparency and comprehensiveness. 2. OMOP CDM Standardization: Data is harmonized and standardized into the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), facilitating unified analysis across heterogeneous datasets. 3. Schema.org and JSON-LD Integration: Machine-readable metadata is generated using Schema.org entities and embedded within the data using JSON-LD, boosting discoverability and comprehension for both machines and human users We demonstrated the implementation of these three stages using the infectious disease surveillance and response (IDSR) data from Malawi and Kenya.

Results:

The implementation of our framework significantly enhanced the FAIRness of population health data, resulting in improved discoverability through seamless integration with platforms like Google Dataset Search. The adoption of standardized formats and protocols streamlined data accessibility and integration across various research environments, fostering collaboration and knowledge sharing. Additionally, the utilization of machine-interpretable metadata empowered researchers to efficiently reuse data for targeted analyses and insights, thereby maximizing the overall value of population health resources. The JSON-LD codes are accessible via GitHub repository, and the HTML code integrated with JSON-LD is available on the INSPIRE website.

Conclusions:

The adoption of machine-readable metadata standards is essential for ensuring the FAIRness of population health data. By embracing these standards, organizations can enhance diverse resource visibility, accessibility, and utility, leading to a broader impact, particularly in low- and middle-income countries (LMICs). Machine-readable metadata can accelerate research, improve healthcare decision-making, and ultimately promote better health outcomes for populations worldwide.

Citation

Please cite as:

Amadi D, Kiwuwa-Muyingo S, Bhattacharjee T, Taylor A, Kiragga A, Ochola M, Kanjala C, Gregor A, Tomlin K, Todd J, Greenfield J

Making Metadata Machine-Readable as the First Step to Providing Findable, Accessible, Interoperable, and Reusable Population Health Data: Framework Development and Implementation Study

Online J Public Health Inform 2024;16:e56237

DOI: 10.2196/56237

PMID: 39088253

PMCID: 11327634

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Online Journal of Public Health Informatics

Date Submitted: Jan 11, 2024

Date Accepted: May 19, 2024

Making metadata machine-readable as the first step to FAIR population health data

ABSTRACT

Citation

Copyright