Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 3, 2019
Open Peer Review Period: May 6, 2019 - Jul 1, 2019
Date Accepted: Mar 12, 2020
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Information Loss in Harmonizing Granular Race and Ethnicity Data: Descriptive Study of Standards

Wang K, Grossetta Nardini H, Post L, Edwards T, Nunez-Smith M, Brandt C

Information Loss in Harmonizing Granular Race and Ethnicity Data: Descriptive Study of Standards

J Med Internet Res 2020;22(7):e14591

DOI: 10.2196/14591

PMID: 32706693

PMCID: 7399950

Standards to harmonize granular race and ethnicity data: Meaningful information is lost in translation

  • Karen Wang; 
  • Holly Grossetta Nardini; 
  • Lori Post; 
  • Todd Edwards; 
  • Marcella Nunez-Smith; 
  • Cynthia Brandt

ABSTRACT

Background:

Data standards for collecting, storing, and indexing information on classification of social determinants of health, such as race and ethnicity, have significant ramifications in health equity research.

Objective:

We describe challenges encountered when working with multiple-race assessment in large health surveys in the Eastern Caribbean Health Outcomes Research Network (ECHORN), a collaborative of Barbados, Puerto Rico, Trinidad and Tobago, and U.S. Virgin Islands.

Methods:

We examined the data standards guiding research studies on race/ethnicity data collection and indexing, including Office of Management of Budget Directive 15 (OMB) and National Library Medicine’s Medical Subject Headings (MeSH), respectively, for a cohort study with multi-racial populations.

Results:

Among 1,211 participants in the ECHORN cohort study, 13% (n=117) selected Caribbean; 7.6% (n=58), Puerto Rican or Boricua; and 10% (n=122), multi-racial category. Over 18% selected two or more categories, with 15.2% (n=184) selecting two, and 2.6% (n=32) selecting three or more categories. With aggregation of ECHORN data into OMB categories, 24% of the participants are placed in the “more than one race” category. Moreover, searching for and retrieving articles related to multi-racial populations involves complicated keyword and synonyms searches and varies in number of articles retrieved (including terms such as biracial (n=863 articles), mixed race (n=6181articles), or multiracial (n=871 articles)).

Conclusions:

This analysis exposes the fundamental informatics challenges with current data standards, from data collection to indexing, that complicate meaningful collection and dissemination of accurate information for diverse and marginalized populations. Current standards should reflect the science of measuring race/ethnicity and the need for multi-disciplinary teams to improve evolving standards across the data lifecycle.


 Citation

Please cite as:

Wang K, Grossetta Nardini H, Post L, Edwards T, Nunez-Smith M, Brandt C

Information Loss in Harmonizing Granular Race and Ethnicity Data: Descriptive Study of Standards

J Med Internet Res 2020;22(7):e14591

DOI: 10.2196/14591

PMID: 32706693

PMCID: 7399950

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.