Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Jun 17, 2022
Date Accepted: Nov 13, 2022

The final, peer-reviewed published version of this preprint can be found here:

Ethical and Methodological Considerations of Twitter Data for Public Health Research: Systematic Review

Takats C, Kwan A, Wormer R, Goldman D, Jones HE, Romero D

Ethical and Methodological Considerations of Twitter Data for Public Health Research: Systematic Review

J Med Internet Res 2022;24(11):e40380

DOI: 10.2196/40380

PMID: 36445739

PMCID: 9748795

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Ethical Considerations and Methodological Uses of Twitter Data for Public Health Research: A Systematic Review

  • Courtney Takats; 
  • Amy Kwan; 
  • Rachel Wormer; 
  • Dari Goldman; 
  • Heidi E Jones; 
  • Diana Romero

ABSTRACT

Background:

Much research is being done using publicly available Twitter data in the field of public health, but what types of research questions these data are being used to answer and the extent to which these projects require ethical oversight is not clear.

Objective:

To describe the current state of public health research using Twitter data in terms of methods/research questions, geographic focus, and ethical considerations including informed consent of Twitter handlers.

Methods:

We implemented a systematic review, following PRISMA guidelines, of articles published between January 2006 and October 31, 2019 using Twitter data in secondary analyses for public health research found using standardized search criteria on SOCIndex, PsychInfo, and/or PubMed. Studies were excluded when using Twitter for primary data collection, such as for study recruitment or as part of a dissemination intervention.

Results:

We identified 367 articles that met eligibility criteria. Infectious disease (21.8%) and substance use (18.0%) were the most common topics for these studies, and sentiment mining (61.9%), surveillance (61.0%), and thematic exploration (59.1%) were the most common methodologies employed. About one-third of articles had a global/worldwide geographic focus; another third focused on the United States. The majority (60.5%) of articles used a native Twitter application programming interface (API), and a significant amount of the remainder (27.8%) used a third-party API. Only one third (32.3%) of studies sought IRB approval, while 16.9% included identifying information on Twitter users and/or tweets and 35.7% attempted to anonymize identifiers. Most studies included discussion of the validity of the measures (73.6%) and reliability of coding (69.7% for inter-reliability of human coding and 70.2% for computer algorithm checks), but less attention was paid to the sampling frame, and what underlying population the sample represented.

Conclusions:

Twitter data may be useful in public health research, given its access to publicly available information. However, studies should exercise greater caution in considering the data sources, accession method, and external validity of the sampling frame. Further, an ethical framework is necessary to help guide future research in this area, especially when individual, identifiable Twitter users and tweets are shared and discussed.


 Citation

Please cite as:

Takats C, Kwan A, Wormer R, Goldman D, Jones HE, Romero D

Ethical and Methodological Considerations of Twitter Data for Public Health Research: Systematic Review

J Med Internet Res 2022;24(11):e40380

DOI: 10.2196/40380

PMID: 36445739

PMCID: 9748795

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.