Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Apr 19, 2021
Date Accepted: Sep 23, 2021
Date Submitted to PubMed: Nov 30, 2021

The final, peer-reviewed published version of this preprint can be found here:

A Pipeline to Understand Emerging Illness Via Social Media Data Analysis: Case Study on Breast Implant Illness

Dey V, Krasniak P, Nguyen M, Lee C, Ning X

A Pipeline to Understand Emerging Illness Via Social Media Data Analysis: Case Study on Breast Implant Illness

JMIR Med Inform 2021;9(11):e29768

DOI: 10.2196/29768

PMID: 34847064

PMCID: 8669576

A Pipeline to Understand Emerging Illness via Social Media Data Analysis: A Case Study on Breast Implant Illness

  • Vishal Dey; 
  • Peter Krasniak; 
  • Minh Nguyen; 
  • Clara Lee; 
  • Xia Ning

ABSTRACT

Background:

A new illness could first come to the public attention over social media before it is medically defined, formally documented or systematically studied. One example is a phenomenon known as breast implant illness (BII) that has been extensively discussed on social media, though vaguely defined in medical literature.

Objective:

To construct a data analysis pipeline to understand emerging illness using social media data, and to apply the pipeline to understand key attributes of BII.

Methods:

We conducted a pipeline of social media data analysis using Natural Language Processing (NLP) and topic modeling. We extracted mentions related to signs/symptoms, diseases/disorders and medical procedures using the Clinical Text Analysis and Knowledge Extraction System (cTAKES) from social media data. We mapped the mentions to standard medical concepts. We summarized mapped concepts to topics using Latent Dirichlet Allocation (LDA). Finally, we applied this pipeline to understand BII from several BII-dedicated social media sites.

Results:

Our pipeline identified topics related to toxicity, cancer and mental health issues that are highly associated with BII. Our pipeline also shows that cancers, autoimmune disorders and mental health problems are emerging concerns associated with breast implants based on social media discussions. The pipeline also identified mentions such as rupture, infection, pain and fatigue as common self-reported issues among the public, as well as toxicity from silicone implants.

Conclusions:

Our study could inspire future work studying the suggested symptoms and factors of BII. Our study provides the first analysis and derived knowledge of BII from social media using NLP techniques, and demonstrates the potential of using social media information to better understand similar emerging illnesses.


 Citation

Please cite as:

Dey V, Krasniak P, Nguyen M, Lee C, Ning X

A Pipeline to Understand Emerging Illness Via Social Media Data Analysis: Case Study on Breast Implant Illness

JMIR Med Inform 2021;9(11):e29768

DOI: 10.2196/29768

PMID: 34847064

PMCID: 8669576

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.