Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Cancer

Date Submitted: Jul 28, 2023
Date Accepted: Apr 4, 2024

The final, peer-reviewed published version of this preprint can be found here:

Differing Content and Language Based on Poster-Patient Relationships on the Chinese Social Media Platform Weibo: Text Classification, Sentiment Analysis, and Topic Modeling of Posts on Breast Cancer

Zhang Z, Liew K, Kuijer R, She WJ, Yada S, Wakamiya S, Aramaki E

Differing Content and Language Based on Poster-Patient Relationships on the Chinese Social Media Platform Weibo: Text Classification, Sentiment Analysis, and Topic Modeling of Posts on Breast Cancer

JMIR Cancer 2024;10:e51332

DOI: 10.2196/51332

PMID: 38723250

PMCID: 11117131

Warning: This is an author submission that is not peer-reviewed or edited. Preprints - unless they show as "accepted" - should not be relied on to guide clinical practice or health-related behavior and should not be reported in news media as established information.

Poster-patients relationships show differing content and language on Chinese Weibo: Text classification, sentiment analysis, and topic modeling of posts on breast cancer

  • Zhouqing Zhang; 
  • Kongmeng Liew; 
  • Roeline Kuijer; 
  • Wan-Jou She; 
  • Shuntaro Yada; 
  • Shoko Wakamiya; 
  • Eiji Aramaki

ABSTRACT

Background:

Breast cancer affects the lives of not only those diagnosed, but also the people around them. Many of those affected share their experiences on social media. However, these narratives may differ according to who the poster is and their relationship with the patient; a patient posting about their experiences may post different content from someone who’s friends or family has breast cancer. In China, Weibo is one of the most popular social media platforms, and breast cancer-related posts are frequently found there. We used Weibo as a resource to examine how posts differ according to the different poster-patient relationships

Objective:

With the goal of understanding the different experiences of those affected by breast cancer in China, we aim to explore how content and language used in relevant posts differ according to who the poster is, and their relationship with the patient. Our goal is to examine if there are differences in emotional expression and topic content if the patient is the poster themselves, or a friend, family, relative or acquaintance.

Methods:

We scraped a total of N=10322 relevant Weibo posts. Using a 2-step analysis method, we first fine-tune two Chinese RoBERTa models, on a dataset annotated with poster-patient relationships. These models were lined in sequence, first a binary classifier (‘no_patient’ or ‘patient’), and multiclass classifier (‘post_user’, ‘family_members’, ‘friends_relatives’, ‘acquaintances’, ‘heard_relation’) to classify patient relationships. Next, we used the LIWC lexicon to conduct sentiment analysis on 5 emotion categories (positive and negative emotions, anger, sadness, and anxiety), followed by topic modeling (BERTopic).

Results:

Our binary model (F1 = 0.93) and multiclass model (F1 = 0.83) were largely able to classify patient relationships accurately. Subsequent sentiment analyses showed significant differences in emotion categories across all patient relationships. Notably, negative emotions and anger were higher for the ‘no_patient’ class, but sadness and anxiety were higher for the ‘family_member’ class. Focusing on the top 30 topics, we also noted that topics about fears and rage towards cancer were higher in the ‘no_patient’ class, but topics about cancer treatment were higher in the ‘family_member’ class.

Conclusions:

Chinese users posted different types of content depending on the poster-patient relationship. If the patient was family, posts were sadder and more anxious, but also contained more content on treatments. However, if no patient was detected, posts showed high levels of anger. We think that this may be the poster ranting, which may help with emotion regulation and gathering social support.


 Citation

Please cite as:

Zhang Z, Liew K, Kuijer R, She WJ, Yada S, Wakamiya S, Aramaki E

Differing Content and Language Based on Poster-Patient Relationships on the Chinese Social Media Platform Weibo: Text Classification, Sentiment Analysis, and Topic Modeling of Posts on Breast Cancer

JMIR Cancer 2024;10:e51332

DOI: 10.2196/51332

PMID: 38723250

PMCID: 11117131

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.