Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Cancer

Date Submitted: Nov 18, 2020
Date Accepted: Jun 13, 2021

The final, peer-reviewed published version of this preprint can be found here:

Analysis of Population Differences in Digital Conversations About Cancer Clinical Trials: Advanced Data Mining and Extraction Study

Perez EA, Jaffee EM, Whyte J, Boyce CA, Carpten JD, Lozano G, Williams RM, Winkfield KM, Bernstein D, Poblete S

Analysis of Population Differences in Digital Conversations About Cancer Clinical Trials: Advanced Data Mining and Extraction Study

JMIR Cancer 2021;7(3):e25621

DOI: 10.2196/25621

PMID: 34554099

PMCID: 8498899

Analysis of Population Differences in Digital Conversations about Cancer Clinical Trials: Advanced Data Mining and Extraction Study

  • Edith A. Perez; 
  • Elizabeth M. Jaffee; 
  • John Whyte; 
  • Cheryl A. Boyce; 
  • John D. Carpten; 
  • Guillermina Lozano; 
  • Raymond M. Williams; 
  • Karen M. Winkfield; 
  • David Bernstein; 
  • Sung Poblete

ABSTRACT

Background:

Racial and ethnic diversity in clinical trials for cancer treatment is essential for the development of treatments that are effective for all patients and to identify potential differences in toxicity between different demographics. Mining of social media discussions about clinical trials has been used previously to identify patient barriers of enrollment into clinical trials, however a comprehensive breakdown of sentiments and barriers by various racial and ethnic groups is lacking.

Objective:

The aim of this study was to use an innovative data mining toolset to analyze online conversations about cancer clinical trials and identify and compare conversation topics, barriers and sentiments between different racial and ethnic populations.

Methods:

We analyzed a total of 372,283 online conversations about cancer clinical trials. 179,339 of the discussions (48%) had identifiable race information. Using sophisticated machine learning software and analyses, we were able to identify key sentiments and feelings, topics of interest and barriers to clinical trials across racial groups. Stage of treatment could also be identified in many of the discussions, allowing for unique insight into how the sentiments and challenges of patients change throughout the treatment process for each racial group.

Results:

We observed that only 4% of cancer-related discussions referenced clinical trials. Within these discussions, the most frequently referenced topics of healthcare professional (HCP) interactions, cost of care, fear, anxiety and lack of awareness affected each racial and ethnic group differently throughout the treatment process.

Conclusions:

This information is valuable to identify the ideal content and timing for delivery of clinical trial information and resources for different racial and ethnic groups. It also demonstrates the power of social media data mining in healthcare research.


 Citation

Please cite as:

Perez EA, Jaffee EM, Whyte J, Boyce CA, Carpten JD, Lozano G, Williams RM, Winkfield KM, Bernstein D, Poblete S

Analysis of Population Differences in Digital Conversations About Cancer Clinical Trials: Advanced Data Mining and Extraction Study

JMIR Cancer 2021;7(3):e25621

DOI: 10.2196/25621

PMID: 34554099

PMCID: 8498899

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.