JMIR Preprints #63032: Comparing Health Survey Data Cost and Quality Between Amazon’s Mechanical Turk and Ipsos’ KnowledgePanel: An Observational Study

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Comparing Health Survey Data Cost and Quality Between Amazon’s Mechanical Turk and Ipsos’ KnowledgePanel: An Observational Study

Patricia M Herman;
Mary E Slaughter;
Nabeel Qureshi;
Tarek Azzam;
David Cella;
Ian D Coulter;
Graham DiGuiseppi;
Maria Orlando Edelen;
Arie Kapteyn;
Anthony Rodriguez;
Max Rubinstein;
Ron D Hays

ABSTRACT

Background:

Researchers have many options for online survey data collection, ranging from access to curated probability-based panels, where individuals are selectively invited to join based on their membership in a representative population, to convenience panels, which are open for anyone to join. The mix of respondents available also varies greatly regarding representation of a population of interest and in motivation to provide thoughtful and accurate responses. Despite the additional dataset-building labor required of the researcher, convenience panels are much less expensive than probability-based panels. However, it is important to understand what may be given up in terms of data quality for those cost savings.

Objective:

This study examined the relative costs and data quality from fielding equivalent surveys on Amazon’s Mechanical Turk (MTurk), a convenience panel that anyone 18 years or older with internet access and an electronic device can join, and on KnowledgePanel, a nationally representative probability-based panel whose members have been selectively invited to join.

Methods:

We administered the same survey measures to MTurk (2021) and KnowledgePanel (2022) members. We applied several quality assurance steps recommended in the literature at the time to enhance the data quality achieved using MTurk. Ipsos, the owner of KnowledgePanel, followed their usual (industry standard) protocols. The survey was designed to support psychometric analyses and included over 60 items from the Patient-Reported Outcomes Measurement Information System© (PROMIS®), demographics, and a list of health conditions. Those who indicated current back pain also received several established instruments used to measure back pain impact. Two fake conditions (“Syndomitis” and “Chekalism”) were used to identify those more likely to be honest respondents. We examined the quality of each platform’s data using several recommended metrics (e.g., consistency, reliability, representativeness, missing data, correlations) including and excluding those respondents who endorsed a fake condition.

Results:

We found that prescreening in the MTurk sample (removing those who endorsed a fake health condition) improved data quality, but KnowledgePanel data quality generally remained superior. While MTurk’s unweighted point estimates for demographics displayed the usual mismatch with national averages (younger, better educated, lower income), weighted MTurk data matched national estimates. KnowledgePanel’s point estimates better matched national benchmarks even before nonresponse weighting. Correlations between PROMIS measures and age and income were similar between MTurk and KnowledgePanel. However, correlations between PROMIS measures and education were dramatically different; the direction of over half of these relationships in the MTurk sample was opposite than expected from theory. Therefore, caution is needed if using MTurk for studies where education is a key variable.

Conclusions:

The data quality of our MTurk sample was often inferior to that of KnowledgePanel, but not so much as to negate the benefits of its cost savings for some uses. Clinical Trial: ClinicalTrials.gov NCT04426812

Citation

Please cite as:

Herman PM, Slaughter ME, Qureshi N, Azzam T, Cella D, Coulter ID, DiGuiseppi G, Edelen MO, Kapteyn A, Rodriguez A, Rubinstein M, Hays RD

Comparing Health Survey Data Cost and Quality Between Amazon’s Mechanical Turk and Ipsos’ KnowledgePanel: Observational Study

J Med Internet Res 2024;26:e63032

DOI: 10.2196/63032

PMID: 39612505

PMCID: 11645511

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jun 7, 2024

Date Accepted: Oct 18, 2024

Comparing Health Survey Data Cost and Quality Between Amazon’s Mechanical Turk and Ipsos’ KnowledgePanel: An Observational Study

ABSTRACT

Citation

Copyright