Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Sep 26, 2022
Open Peer Review Period: Sep 26, 2022 - Nov 21, 2022
Date Accepted: Jan 27, 2023
(closed for review but you can still tweet)

The final, peer-reviewed published version of this preprint can be found here:

Examining Rural and Urban Sentiment Difference in COVID-19–Related Topics on Twitter: Word Embedding–Based Retrospective Study

Liu Y, Yin Z, Ni C, Yan C, Wan Z, Malin B

Examining Rural and Urban Sentiment Difference in COVID-19–Related Topics on Twitter: Word Embedding–Based Retrospective Study

J Med Internet Res 2023;25:e42985

DOI: 10.2196/42985

PMID: 36790847

PMCID: 9937112

Examining Rural and Urban Sentiment Difference in COVID-19 Related Topics on Twitter: A Word Embedding Approach

  • Yongtai Liu; 
  • Zhijun Yin; 
  • Congning Ni; 
  • Chao Yan; 
  • Zhiyu Wan; 
  • Bradley Malin

ABSTRACT

Background:

By August 2022, more than 93 million people were infected with COVID-19 in the US, while the death rate in rural areas (325/100,000) was much higher than in urban areas (248/100,000). As the pandemic spread, people have used social media platforms to express their opinions and concerns about COVID-19 related topics.

Objective:

This study aimed to 1) identify the key COVID-19 related topics in the Contiguous US communicated over Twitter and 2) compare the sentiment about these topics between urban and rural users.

Methods:

We collected tweets containing geolocation data from May 2020 to January 2022 in the Contiguous US. We relied on tweet’s geolocation to determine if its author was in an urban or rural setting. We trained multiple word2vec models with several corpora of tweets based on geospatial and timing information. Using word2vec model built on all tweets, we identified hashtags relevant to COVID-19, and performed a hashtag clustering to obtain related topics. We then run an inference analysis for urban and rural sentiments with respect to a topic based on the similarity between topic hashtags and opinion adjectives in the corresponding urban and rural word2vec models. Finally, we analyzed the temporal trend in sentiment using monthly word2vec models.

Results:

In a corpus of 407 million tweets, 350 million (86.0%) were posted by users in metropolitan areas, while 18 million (4.4%) were posted by users in rural areas. There were 2,666 hashtags related to COVID-19, which clustered into 20 topics. Rural users expressed a stronger negative sentiment than urban users about COVID-19 prevention strategies and vaccination (P < .001). There was an apparent political divide in the perception of politicians by urban and rural users, who communicated a stronger negative sentiment about Republican and Democratic politicians, respectively (P < .001). Regarding misinformation and conspiracy theory, urban users exhibited a stronger negative sentiment about covidiots and China virus topics, while rural users exhibited a stronger negative sentiment about Dr. Fauci and plandemic topics. Finally, we observed that urban users’ sentiment about economy appeared a transition from negative to positive in late 2021, which was in line with the US economic recovery.

Conclusions:

This study demonstrates that there is a statistically significant differences in the sentiment of urban and rural Twitter users regarding a wide range of COIVD-19 related topics. This suggests that social media can potentially serve as the basis for a tool to monitor public sentiment during pandemics for disparate types of regions. In turn, this may assist in the geographically-targeted deployment of epidemic prevention and management efforts. Though this research focused on COVID-19 specifically, it can be readily reused to investigate other topics without additional data collection or model training.


 Citation

Please cite as:

Liu Y, Yin Z, Ni C, Yan C, Wan Z, Malin B

Examining Rural and Urban Sentiment Difference in COVID-19–Related Topics on Twitter: Word Embedding–Based Retrospective Study

J Med Internet Res 2023;25:e42985

DOI: 10.2196/42985

PMID: 36790847

PMCID: 9937112

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.