Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Dec 13, 2022
Date Accepted: Aug 15, 2023

The final, peer-reviewed published version of this preprint can be found here:

Hot Topic Recognition of Health Rumors Based on Anti-Rumor Articles on the WeChat Official Account Platform: Topic Modeling

Li Z, Wu X, Xu L, Liu M, Huang C

Hot Topic Recognition of Health Rumors Based on Anti-Rumor Articles on the WeChat Official Account Platform: Topic Modeling

J Med Internet Res 2023;25:e45019

DOI: 10.2196/45019

PMID: 37733396

PMCID: 10557010

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Topic Modeling of Health Rumors Based on Anti-Rumor Tweets on the WeChat Platform: Machine Learning Analysis

  • Ziyu Li; 
  • Xiaoqian Wu; 
  • Lin Xu; 
  • Ming Liu; 
  • Cheng Huang

ABSTRACT

Background:

Social network has become one of the main channels for the public to obtain health information. However, it has also become a source for the spread of health-related misinformation. Health-related misinformation seriously threatens the public’s physical and mental health. Topic identification is the premise of health-related misinformation governance.

Objective:

This study aims to explore the types of health-related misinformation favored by WeChat public platform users and their prevalence trends.

Methods:

We used a web crawler tool to capture health rumor-dispelling tweets collected on the rumor-dispelling public account. We collected text information from health-debunking tweets posted between January 1, 2016, and August 31, 2022. Following word segmentation of the collected text, a document topic generation model, Latent Dirichlet Assignment, is used to identify and generalize the most common topics. The proportion distribution of themes was calculated, and the negative impact of various health rumors in different periods was subsequently analyzed. Additionally, the prevalence of health rumors was analyzed using the number of health rumors generated at each time point.

Results:

From January 1, 2016 to August 31, 2022, we collected 9,366 rumor-refuting tweets from WeChat official accounts. Through topic modeling, we divided the health rumors into eight topics, including the prevention and treatment of infectious diseases (n = 1,284, 13.71%), disease therapy and its effects (n = 1,037, 11.07%), food safety (n = 1,243, 13.27%), cancer and its causes (n = 946, 10.10%), regimen and disease (n = 1,540, 16.44%), rumors of transmission (n = 914, 9.76%), healthy diet (n = 1,068, 11.40%), and nutrition and health (n = 1,334, 14.24%). Furthermore, we summarized the eight topics into four themes, including public health, disease, diet and health, and rumor spreading.

Conclusions:

Our study shows that the topic model can provide analysis and insights into health rumor governance. The analysis of rumor development trends shows that public health, disease, and diet and health problems are the most affected areas of rumors. Governments still need to consider national conditions, formulate appropriate policies, and deal with health rumors more comprehensively. While ensuring the health of the Internet, we should also improve the level of national quality education. We recommend that additional sentiment analysis-related studies be conducted to verify the impact of health rumor-related topics.


 Citation

Please cite as:

Li Z, Wu X, Xu L, Liu M, Huang C

Hot Topic Recognition of Health Rumors Based on Anti-Rumor Articles on the WeChat Official Account Platform: Topic Modeling

J Med Internet Res 2023;25:e45019

DOI: 10.2196/45019

PMID: 37733396

PMCID: 10557010

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.