JMIR Preprints #30753: A Machine Learning Approach for Characterizing and Identifying the Prevalence of Online Misinformation Relating to Medication for Opioid Use Disorder

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

A Machine Learning Approach for Characterizing and Identifying the Prevalence of Online Misinformation Relating to Medication for Opioid Use Disorder

Mai ElSherief;
Steven A. Sumner;
Christopher M. Jones;
Royal K. Law;
Akadia Kacha-Ochana;
Lyna Shieber;
LeShaundra Cordier;
Kelly Holton;
Munmun De Choudhury

ABSTRACT

Background:

Expanding access to and use of medication for opioid use disorder (MOUD) is a key component of overdose prevention. One important barrier to uptake of MOUD is exposure to inaccurate and potentially harmful health misinformation in social media or online forums, where individuals commonly seek information. There is a significant need to devise computational techniques to describe the prevalence of online health misinformation related to MOUD to facilitate mitigation efforts.

Objective:

Adopting a multidisciplinary, mixed methods strategy, this paper presents machine learning and natural language analysis approaches to identify the characteristics and prevalence of online misinformation related to MOUD in order to inform future prevention, treatment, and response efforts.

Methods:

The team harnessed English language public social media posts and comments from Twitter (6,365,245 posts), YouTube (99,386 posts), Reddit (13,483,419 posts), and Drugs-Forum.com (5,549 posts). Leveraging public health expert annotations on a sample of 2,400 of these social media posts that were found to be semantically most similar to a variety of prevailing opioid use disorder (OUD)-related myths based on representational learning, the team developed a supervised machine learning classifier. This classifier identified whether a post’s language promoted one of the leading myths challenging addiction treatment: that use of agonist therapy for MOUD is simply replacing one drug with another. Platform-level prevalence was calculated thereafter by machine labeling all unannotated posts with the classifier and noting the proportion of myth-indicative posts over all posts.

Results:

Our results demonstrate promise in identifying social media postings that center around treatment myths about OUD with an accuracy of 91% and an AUC of 0.9, including how these discussions vary across platforms in terms of prevalence and linguistic characteristics, with the lowest prevalence on Online Health Communities (OHCs) such as Reddit and Drugs-Forum and the highest on Twitter. Specifically, the prevalence of the stated MOUD myth ranged from 0.4% on Online Health Communities (OHCs) to 0.9% on Twitter.

Conclusions:

This work provides one of the first large scale assessments of a key MOUD-related myths across multiple social media platforms and highlights the feasibility and importance in ongoing assessment of health misinformation related to addiction treatment.

Citation

Please cite as:

ElSherief M, Sumner SA, Jones CM, Law RK, Kacha-Ochana A, Shieber L, Cordier L, Holton K, De Choudhury M

Characterizing and Identifying the Prevalence of Web-Based Misinformation Relating to Medication for Opioid Use Disorder: Machine Learning Approach

J Med Internet Res 2021;23(12):e30753

DOI: 10.2196/30753

PMID: 34941555

PMCID: 8734931

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: May 27, 2021

Open Peer Review Period: May 27, 2021 - Jul 22, 2021

Date Accepted: Oct 19, 2021

Date Submitted to PubMed: Dec 23, 2021

(closed for review but you can still tweet)

A Machine Learning Approach for Characterizing and Identifying the Prevalence of Online Misinformation Relating to Medication for Opioid Use Disorder

ABSTRACT

Citation

Copyright