Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jan 16, 2025
Date Accepted: May 12, 2025

The final, peer-reviewed published version of this preprint can be found here:

The Lifecycle of Electronic Health Record Data in HIV-Related Big Data Studies: Qualitative Study of Bias Instances and Potential Opportunities for Minimization

N'Diaye A, Qiao S, Garrett C, Khushf G, Zhang J, Li X, Olatosi B

The Lifecycle of Electronic Health Record Data in HIV-Related Big Data Studies: Qualitative Study of Bias Instances and Potential Opportunities for Minimization

J Med Internet Res 2025;27:e71388

DOI: 10.2196/71388

PMID: 40773672

PMCID: 12331130

The Lifecycle of Electronic Health Record Data in HIV-Related Big Data Studies: A Qualitative Study of Instances of and Potential Opportunities to Minimize Bias

  • Arielle N'Diaye; 
  • Shan Qiao; 
  • Camryn Garrett; 
  • George Khushf; 
  • Jiajia Zhang; 
  • Xiaoming Li; 
  • Bankole Olatosi

ABSTRACT

Background:

Electronic health record (EHR) data are widely used in public health research, including HIV-related studies, but are limited by potential bias due to incomplete and inaccurate information, lack of generalizability, and lack of representativeness.

Objective:

This study explores how workflow processes within HIV clinics, among data scientists, and within state health departments may introduce and minimize bias within EHRs.

Methods:

Using a constructivist grounded theory approach, in-depth individual interviews were conducted with 16 participants purposively sampled in South Carolina from August 2023-April 2024. A focus group with 3 health department professionals with expertise in HIV disease surveillance was also conducted. Analysis was conducted as outlined by Charmaz (2006).

Results:

To reduce bias in EHR data, information entry forms should be designed to expansively include patient self-reported social determinants of health (SDOH) information. During data collection, healthcare providers should create a supportive healthcare environment, facilitate SDOH information disclosure, and accurately document patient information. Patients should have access to their EHRs to confirm that SDOH information are correctly recorded. During data curation, data scientists should inspect datasets for completeness, accuracy, and educate public health researchers on dataset limitations. During data management and utilization, health department professionals should crossmatch data across the state, customize data collection systems to reflect local needs, and provide community-based data education and stigma management.

Conclusions:

Study results suggest that future research is needed to understand how healthcare systems can be incentivized to create and implement EHR bias reduction strategies across all workflows and between stakeholders.


 Citation

Please cite as:

N'Diaye A, Qiao S, Garrett C, Khushf G, Zhang J, Li X, Olatosi B

The Lifecycle of Electronic Health Record Data in HIV-Related Big Data Studies: Qualitative Study of Bias Instances and Potential Opportunities for Minimization

J Med Internet Res 2025;27:e71388

DOI: 10.2196/71388

PMID: 40773672

PMCID: 12331130

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.