JMIR Preprints #88838: Battling the bots: Defending against fraudulent responses while conducting an international community-engaged web-based survey with people living with Long COVID

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Battling the bots: Defending against fraudulent responses while conducting an international community-engaged web-based survey with people living with Long COVID

Kiera McDuff;
Tai-Te Su;
Darren A. Brown;
Jessica M. Martin;
Soo Chan Carusone;
Sarah O'Connell;
Imelda O'Donovan;
Natalie St. Clair-Sullivan;
Liam Townsend;
Susie Goulding;
Mary Kelly;
Lisa McCorkell;
Hannah Wei;
Margaret O'Hara;
Leticia Soares;
Lisa Avery;
Ciaran Bannan;
Colm Bergin;
Richard Harding;
Julia Nathanson;
Patricia Solomon;
Angela M. Cheung;
Jaimie Vera;
Kelly K. O'Brien

ABSTRACT

Background:

Web-based surveys involving self-reported questionnaires are vulnerable to fraudulent responses. Advancements in artificial intelligence (AI) and bots has introduced additional challenges to preventing and identifying fraudulent responses to online questionnaires.

Objective:

To describe our experiences with fraudulent responses, strategies for preventing and identifying fraudulent responses, lessons learned when conducting a web-based survey with adults living with Long COVID, and recommendations for web-based survey research.

Methods:

The Long COVID and Episodic Disability Study is an international community-engaged study among adults living with Long COVID in Canada, Ireland, United Kingdom (UK), and United States (US). We conducted a longitudinal web-based survey, with online administration of a self-reported questionnaire at two timepoints (Time One and Time Two), one week apart. We recruited through Long COVID community groups using social media, emails, and word of mouth. The survey was disrupted by fraudulent responses, including bots. To defend data integrity, we implemented the following strategies: a) pausing our initial launch (Wave One), b) developing and implementing screening criteria to identify fraudulent responses, and c) re-launching the web-based survey (Wave Two) with revised recruitment strategies and questionnaire design to prevent, and identify fraudulent responses.

Results:

We received 4663 responses for Time One and 1281 responses for Time Two, of which we retained 798/4663 (17%) and 629/1281 (49%). Strategies for preventing fraudulent responses included enabling survey protection features in survey software, shutting down compromised survey links, avoiding recruitment via public social media groups, and removing mention of a financial incentive from recruitment materials. Strategies for identifying fraudulent responses included monitoring response completion times, start and end time stamps, geolocation, and screening for suspicious email address characteristics and duplicates.

Conclusions:

Our lessons learned fell into three areas: 1) survey-design and implementation to prevent and identify fraudulent and bot-generated responses; 2) recruitment strategies to mitigate risk of disruption by bots; and 3) responding to disruptions caused by fraudulent and bot responses. We recommend the following tactics to prevent and mitigate the risks of fraudulent and bot responses when administering online web-based questionnaires: a) review current literature and connect with researchers and Research Ethics Boards (REBs) about strategies prior to launching; b) invest in survey software with rigorous info-security technology; c) employ bot-detection features available in survey software prior to launching; d) design questionnaire items to identify bots and fraudulent actors; e) tailor criteria for identifying fraudulent and bot responses to the characteristics of the target population; f) avoid recruitment in public social media groups; g) engage community leaders in tailored and targeted recruitment; h) avoid advertising incentives; i) shut down compromised links rapidly; j) communicate with the REB about disruptions; and k) combine automated with manual methods to identify potentially fraudulent responses in a timely manner. Clinical Trial: n/a

Citation

Please cite as:

McDuff K, Su TT, Brown DA, Martin JM, Chan Carusone S, O'Connell S, O'Donovan I, St. Clair-Sullivan N, Townsend L, Goulding S, Kelly M, McCorkell L, Wei H, O'Hara M, Soares L, Avery L, Bannan C, Bergin C, Harding R, Nathanson J, Solomon P, Cheung AM, Vera J, O'Brien KK

Battling the Bots and Defending Against Fraudulent Responses in an International Community-Engaged Web-Based Survey With People Living With Long COVID: Methodological Study

J Med Internet Res 2026;28:e88838

DOI: 10.2196/88838

PMID: 42492482

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Dec 9, 2025

Date Accepted: Jun 3, 2026

Battling the bots: Defending against fraudulent responses while conducting an international community-engaged web-based survey with people living with Long COVID

ABSTRACT

Citation

Copyright