Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Aug 19, 2019
Date Accepted: Aug 13, 2020
Data Quality Issues With Physician-Rating Websites: A Systematic Review
ABSTRACT
Background:
In recent years, online physician-rating websites have become prominent and exert considerable influence on patients’ decisions. However, the quality of these decisions depends on the quality of data that these systems collect. Thus, there is a need to examine the various data quality issues with physician-rating websites.
Objective:
This study’s objective was to identify and categorize the data quality issues afflicting physician-rating websites by reviewing the literature on online patient-reported physician ratings and reviews.
Methods:
We performed a systematic literature search in ACM Digital Library, EBSCO, Springer, PubMed, and Google Scholar. The search was limited to quantitative, qualitative, and mixed-methods papers published in the English language from 2001 to 2020.
Results:
A total of 423 articles were screened. From these, 49 papers describing 18 unique data quality issues afflicting physician-rating sites were included. Using a data quality framework, we classified these issues into four categories: intrinsic; contextual; representational; and accessible. 53% (26/49) of the papers reported intrinsic data quality errors, 61% (30/49) highlighted contextual data quality issues, 8% (4/49) discussed representational data quality issues, and 27% (13/49) emphasized accessibility data quality. More than half the papers discuss multiple categories of data quality issues.
Conclusions:
The results from this review demonstrate the presence of a range of data quality issues. While intrinsic and contextual factors have been well-researched, accessibility and representational issues warrant more attention from researchers, as well as practitioners. In particular, representational factors, such as the impact of inline advertisements and the positioning of positive reviews on the first few pages are usually deliberate and result from the physician-rating websites’ business models. The impact of these factors on data quality has not been addressed adequately and requires further investigation.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.