Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Dec 14, 2020
Date Accepted: Feb 18, 2021
Date Submitted to PubMed: Feb 22, 2021

The final, peer-reviewed published version of this preprint can be found here:

Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study

Zhang C, Xu S, Li Z, Hu S

Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study

J Med Internet Res 2021;23(3):e26482

DOI: 10.2196/26482

PMID: 33617460

PMCID: 7939057

Understanding Concerns, Sentiments and Disparities of Population Groups During the COVID-19 Pandemic: A Cross-Sectional Study Based on Large-Scale Twitter Mining

  • Chunyan Zhang; 
  • Songhua Xu; 
  • Zongfang Li; 
  • Shunxu Hu

ABSTRACT

Background:

Since the outbreak of the COVID-19 pandemic in late 2019, its far-reaching impacts have been globally witnessed across all aspects of human life, such as health, economy, politics and education. Such widely penetrating impacts cast significant and profound burdens on all population groups, incurring varied concerns and sentiments among them.

Objective:

This study aims to discern the concerns, sentiments and disparities of various population groups during the COVID-19 pandemic through a cross-sectional study conducted via large-scale Twitter mining.

Methods:

The study conducted in this work consists of three steps: first, tweets posted during the pandemic were collected and preprocessed on a large scale; then the key population attributes, concerns, sentiments and emotions were extracted via a collection of natural language processing procedures; at last, multivariable analysis was conducted to reveal concerns, sentiments and disparities of population groups during the pandemic. Overall, this study implements a quick, effective and economical approach for analyzing population-level disparities during public health events. The source code developed in this study is released for free public use at https://github.com/cyzhang87/EmulatedQuestionnaireOnTwitter.

Results:

1,015,655 original English tweets posted between August 7 to 12, 2020, were acquired and analyzed to obtain the following results. Organizations are significantly more concerned about COVID-19 (OR=3.48 (95%CI: 3.39-3.58)) and have more ‘Fear’ and ‘Depression’ emotions than individuals. Females are less concerned about COVID-19 (OR=0.73 (95%CI: 0.71-0.75)) and have less ‘Fear’ and ‘Depression’ emotions than males. Among all age groups (below eighteen, nineteen to twenty-nine, thirty to thirty-nine, and above forty years old), the attention ORs of COVID-19, ‘Fear’ and ‘Depression’ increase significantly with the increase of age. It is worth noting that not all females pay less attention to COVID-19 than males. In the age group of above forty years old, females are more concerned than males, especially in the economic and education topics. Besides, males above forty and below eighteen years old are the least positive. Lastly, the sentiment polarities concerning political topics are the lowest among all population groups.

Conclusions:

Through large-scale Twitter mining, this study reveals that meaningful differences regarding concerns and sentiments on COVID-19 related topics exist among population groups during the study period. Therefore, specialized and varied attention and supports are in need for different population groups. In addition, the efficient analysis method implemented by our publicly released code can be utilized to dynamically track the evolution of each population group during the pandemic or any other major events for better informed public health research and intervention.


 Citation

Please cite as:

Zhang C, Xu S, Li Z, Hu S

Understanding Concerns, Sentiments, and Disparities Among Population Groups During the COVID-19 Pandemic Via Twitter Data Mining: Large-scale Cross-sectional Study

J Med Internet Res 2021;23(3):e26482

DOI: 10.2196/26482

PMID: 33617460

PMCID: 7939057

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.