Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 19, 2024
Date Accepted: Jan 26, 2025

The final, peer-reviewed published version of this preprint can be found here:

Challenges for Data Quality in the Clinical Data Life Cycle: Systematic Review

An D, Lim M, Lee S

Challenges for Data Quality in the Clinical Data Life Cycle: Systematic Review

J Med Internet Res 2025;27:e60709

DOI: 10.2196/60709

PMID: 40266662

PMCID: 12059509

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Challenges of Data Quality in Clinical Data Life Cycle: A Systematic Review

  • Doyeon An; 
  • Minsik Lim; 
  • Suehyun Lee

ABSTRACT

Background:

It is anticipated that electronic health record (EHR) data will inform the development of health policy systems across countries and furnish valuable insights for the advancement of health and medical technology. As the current paradigm of clinical research is shifting toward data-centricity, the utilization of healthcare data is becoming increasingly emphasized.

Objective:

We aimed to review the literature on clinical data quality management and define a process for ensuring the quality management of clinical data, especially in the secondary utilization of data.

Methods:

A systematic review of PubMed articles from 2010 to October 2023 was conducted to assess the quality of electronic health record (EHR) and clinical data. Articles that defined quality management procedures based on the life cycle of clinical data quality management and discussed quality management assessment methods and tools were selected. The articles were categorized into four themes.

Results:

We reviewed 105 papers describing the clinical data quality management process. This process is based on a four-stage life cycle: planning, construction, operation, and utilization. The most frequently used dimensions were completeness, plausibility, concordance, security, currency, and interoperability.

Conclusions:

Given the importance of the secondary use of EHR data, standardized quality control methods and automation are necessary. This study proposes a process to standardize data quality management and develop a data quality assessment system.


 Citation

Please cite as:

An D, Lim M, Lee S

Challenges for Data Quality in the Clinical Data Life Cycle: Systematic Review

J Med Internet Res 2025;27:e60709

DOI: 10.2196/60709

PMID: 40266662

PMCID: 12059509

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.