Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 22, 2020
Date Accepted: Oct 28, 2020
Date Submitted to PubMed: Oct 29, 2020

The final, peer-reviewed published version of this preprint can be found here:

Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach

Xue J, Chen J, Hu R, Chen C, Zheng C, Su Y, Zhu T

Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach

J Med Internet Res 2020;22(11):e20550

DOI: 10.2196/20550

PMID: 33119535

PMCID: 7690968

Twitter discussions and emotions about COVID-19 pandemic: a machine learning approach

  • Jia Xue; 
  • Junxiang Chen; 
  • Ran Hu; 
  • Chen Chen; 
  • Chengda Zheng; 
  • Yue Su; 
  • Tingshao Zhu

ABSTRACT

Background:

Public response to the COVID-19 pandemic is important to be measured. Twitter data are an important source for the infodemiology study of public response monitoring.

Objective:

The objective of the study is to examine coronavirus disease (COVID-19) related discussions, concerns, and sentiments that emerged from tweets posted by Twitter users.

Methods:

We analyze 4 million Twitter messages related to the COVID-19 pandemic using a list of 20 hashtags such as "coronavirus," "COVID-19," "quarantine" from March 7 to April 21 in 2020. We use a machine learning approach, Latent Dirichlet Allocation (LDA), to identify popular unigram, bigrams, salient topics and themes, and sentiments in the collected tweets.

Results:

Popular unigrams include "virus," "lockdown," and "quarantine." Popular bigrams include "COVID-19," "stay home," "corona virus," "social distancing," and "new cases." We identify 13 discussion topics and categorize them into 5 different themes, such as "public health measures to slow the spread of COVID-19," “social stigma associated with COVID-19,” “coronavirus news cases and deaths,” “COVID-19 in the United States,” and “coronavirus cases in the rest of the world.” Across all identified topics, the dominant sentiments for the spread of coronavirus are anticipation that measures that can be taken, followed by a mixed feeling of trust, anger, and fear for different topics. The public reveals a significant feeling of fear when discussing the coronavirus new cases and deaths than other topics.

Conclusions:

The study shows that Twitter data and machine learning approaches can be leveraged for infodemiology study by studying the evolving public discussions and sentiments during the COVID-19. As the situation evolves rapidly, several topics are consistently dominant on Twitter, such as “the confirmed cases and death rates,” “preventive measures,” “health authorities and government policies,” “COVID-19 stigma,” and “negative psychological reactions (e.g., fear).” Real-time monitoring and assessment of the Twitter discussions and concerns can be promising for public health emergency responses and planning. Already emerged pandemic fear, stigma, and mental health concerns may continue to influence public trust when there occurs a second wave of COVID-19 or a new surge of the imminent pandemic.


 Citation

Please cite as:

Xue J, Chen J, Hu R, Chen C, Zheng C, Su Y, Zhu T

Twitter Discussions and Emotions About the COVID-19 Pandemic: Machine Learning Approach

J Med Internet Res 2020;22(11):e20550

DOI: 10.2196/20550

PMID: 33119535

PMCID: 7690968

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.