Accepted for/Published in: Journal of Medical Internet Research
Date Submitted: Dec 14, 2020
Date Accepted: Feb 18, 2021
Date Submitted to PubMed: Feb 22, 2021
Understanding Concerns, Sentiments and Disparities of Population Groups During the COVID-19 Pandemic: A Cross-Sectional Study Based on Large-Scale Twitter Mining
ABSTRACT
Background:
Since the outbreak of the COVID-19 pandemic in late 2019, its far-reaching impacts have been globally witnessed across all aspects of human life, such as health, economy, politics and education. Such widely penetrating impacts cast significant and profound burdens on all population groups, incurring varied concerns and sentiments among them.
Objective:
This study aims to discern the concerns, sentiments and disparities of various population groups during the COVID-19 pandemic through a cross-sectional study conducted via large-scale Twitter mining.
Methods:
The study conducted in this work consists of three steps: first, tweets posted during the pandemic were collected and preprocessed on a large scale; then the key population attributes, concerns, sentiments and emotions were extracted via a collection of natural language processing procedures; at last, multivariable analysis was conducted to reveal concerns, sentiments and disparities of population groups during the pandemic. Overall, this study implements a quick, effective and economical approach for analyzing population-level disparities during public health events. The source code developed in this study is released for free public use at https://github.com/cyzhang87/EmulatedQuestionnaireOnTwitter.
Results:
1,015,655 original English tweets posted between August 7 to 12, 2020, were acquired and analyzed to obtain the following results. Organizations are significantly more concerned about COVID-19 (OR=3.48 (95%CI: 3.39-3.58)) and have more ‘Fear’ and ‘Depression’ emotions than individuals. Females are less concerned about COVID-19 (OR=0.73 (95%CI: 0.71-0.75)) and have less ‘Fear’ and ‘Depression’ emotions than males. Among all age groups (below eighteen, nineteen to twenty-nine, thirty to thirty-nine, and above forty years old), the attention ORs of COVID-19, ‘Fear’ and ‘Depression’ increase significantly with the increase of age. It is worth noting that not all females pay less attention to COVID-19 than males. In the age group of above forty years old, females are more concerned than males, especially in the economic and education topics. Besides, males above forty and below eighteen years old are the least positive. Lastly, the sentiment polarities concerning political topics are the lowest among all population groups.
Conclusions:
Through large-scale Twitter mining, this study reveals that meaningful differences regarding concerns and sentiments on COVID-19 related topics exist among population groups during the study period. Therefore, specialized and varied attention and supports are in need for different population groups. In addition, the efficient analysis method implemented by our publicly released code can be utilized to dynamically track the evolution of each population group during the pandemic or any other major events for better informed public health research and intervention.
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.