Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Dec 16, 2022
Date Accepted: Jun 8, 2023
Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.
Methods used in the development of Common Data Models for health data – a Scoping Review
ABSTRACT
Background:
Common Data Models (CDMs) are essential tools for data harmonization, which can lead to significant improvements in the health domains. CDMs unite data from disparate sources and ease collaborations across institutions that finally result in the generation of large standardized data repositories across different entities. An overview of existing CDMs and methods used to develop these datasets may assist the development process of future models for the health domain, e.g., for decision support systems.
Objective:
This Scoping Review (Sc-R) investigates methods used in the development of CDMs for health data. We aim to provide a broad overview of approaches and guidelines that are used in developing CDMs, i.e., Common Data Elements (CDEs) or Common Data Sets (CDS), for different health domains on an international level.
Methods:
This Sc-R is following the “Preferred Reporting Items for Systematic Review and Meta-Analysis extension for Scoping Reviews” (PRISMA-ScR) Checklist. We conducted our literature research in prominent databases, namely PubMed, Web of Science, Science Direct, and Scopus for five-year publications, starting from 2017 until March 2022. We identified and screened 801 articles. The included articles are evaluated based on the type of utilized method used in the conception, data collection, implementation, and evaluation phase of CDMs, and whether stakeholders (such as medical experts, patients, and IT Staff) were involved during this process. Moreover, the models are grouped into iterative or linear types based on the imperativeness of the stages during development.
Results:
We finally identified 33 articles that fit our eligibility criteria. Of these articles, 26 specifically focus on common medical conditions, five on rare medical conditions, and the remaining two are fitting to both categories. The development process usually involves stakeholders, but in different ways (e.g., working group meetings, Delphi, interviews, and questionnaires). 10 Models followed an iterative process.
Conclusions:
The articles included show the diversity of methods used to develop a CDM in different domains of health. We highlight the need for more specialized CDM development methods in the health domains and propose a suggestive development process that might ease the development process of the CDMs in the health domain in the future.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.