Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Apr 12, 2024
Date Accepted: Sep 24, 2024

The final, peer-reviewed published version of this preprint can be found here:

Long COVID Discourse in Canada, the United States, and Europe: Topic Modeling and Sentiment Analysis of Twitter Data

AbuRaed AGT, Prikryl EA, Tang L, Carenini G, Janjua NZ

Long COVID Discourse in Canada, the United States, and Europe: Topic Modeling and Sentiment Analysis of Twitter Data

J Med Internet Res 2024;26:e59425

DOI: 10.2196/59425

PMID: 39652387

PMCID: 11667136

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Tracking Long COVID Discourse on Twitter: A Comparative Study across Canada, the US, and English-Speaking Europe (including the UK), Using Topic Modeling and Sentiment Analysis

  • Ahmed Ghassan Tawfiq AbuRaed; 
  • Emil Azuma Prikryl; 
  • Lisa Tang; 
  • Giuseppe Carenini; 
  • Naveed Zafar Janjua

ABSTRACT

Background:

Social media serves as a vast repository of data, offering insights into the perceptions and emotions surrounding significant societal issues. Amidst the COVID-19 pandemic, Long COVID has emerged as a chronic health condition, profoundly affecting numerous lives and livelihoods. Given the dynamic nature of Long COVID and our evolving understanding of it, effectively capturing people's sentiments and perceptions through social media becomes increasingly crucial. By harnessing the wealth of data available on social platforms, we can better track the evolving narrative surrounding Long COVID and the collective efforts to address this pressing issue.

Objective:

We aimed to investigate people’s perceptions and sentiments around Long COVID in Canada, the US, and English-speaking Europe, including the UK.

Methods:

We analyzed Long COVID–related tweets from 2021 using topic modeling and sentiment analysis, and interpreted the results with public health experts. We compared timelines of topics discussed across the three English-speaking regions, and also explored sentiments associated with the tweets.

Results:

Topic modeling identified five topics, and public health experts provided interpretations of the topics based on visualization, top-ranked salient terms, and representative tweets for each topic. The interpretation for each topic was not entirely distinct from one topic to the next, resulting in difficulty identifying clear patterns when analyzing the prominence of these topics over time. However, there appeared to be an overall increasing trend in the topic of duration and suffering associated with Long COVID across the regions, consistent with the increasing burden of and awareness around Long COVID as the pandemic progressed and the number of people living with Long COVID increased.

Conclusions:

Analysis aided by natural language processing techniques and domain expert input has the potential to produce useful insights for public health monitoring and action. This study explored the use of topic modeling and sentiment analysis on Long COVID–related tweets in Canada, the US, and English-speaking Europe. This kind of information could serve as one input in public health monitoring and surveillance systems by providing a high-level understanding of the frequency and predominance of discussions and sentiments pertaining to public health issues such as Long COVID, as well as to future efforts to address such issues.


 Citation

Please cite as:

AbuRaed AGT, Prikryl EA, Tang L, Carenini G, Janjua NZ

Long COVID Discourse in Canada, the United States, and Europe: Topic Modeling and Sentiment Analysis of Twitter Data

J Med Internet Res 2024;26:e59425

DOI: 10.2196/59425

PMID: 39652387

PMCID: 11667136

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.