Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Oct 18, 2022
Date Accepted: Aug 18, 2023

The final, peer-reviewed published version of this preprint can be found here:

Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework

Fisher A, Young M, Payer D, Pacheco K, Dubeau C, Mago V

Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework

J Med Internet Res 2023;25:e43630

DOI: 10.2196/43630

PMID: 37725410

PMCID: 10548323

Automating Detection of Drug-Related Harms on Social Media: A Machine Learning Framework

  • Andrew Fisher; 
  • Matthew Young; 
  • Doris Payer; 
  • Karen Pacheco; 
  • Chad Dubeau; 
  • Vijay Mago

ABSTRACT

Background:

One of the hallmarks of unregulated drug markets is their unpredictability and constant evolution with newly introduced substances. People who use drugs and the public health workforce are often unaware of the appearance of new drugs on the unregulated market and their type, safe dosage, and potential adverse effects. This increases risks to people who use drugs, including the risk of unknown consumption and unintentional drug poisoning. Early Warning Systems can help monitor the landscape of emerging drugs in a given community by collecting and tracking up-to-date information and determining trends. However, there are currently few ways to systematically monitor the appearance and harms of new drugs on the unregulated market in Canada.

Objective:

The goal of this work is to examine how artificial intelligence can assist in identifying patterns of drug-related risks and harms, specifically by monitoring the social media activity of public health and law enforcement groups. This information is beneficial in the form of an Early Warning System as it can be used to identify new and emerging drug trends in various communities.

Methods:

To build a dataset for this study, 145 relevant Twitter accounts throughout Quebec (33), Ontario (78), and British Columbia (34) were manually identified. Tweets posted between August 23 and December 21, 2021 were collected via the Twitter API for a total of 40,393 tweets. Next, subject matter experts 1) developed a keyword filter that reduced the dataset to 3,746 tweets and 2) manually identified which tweets were relevant to monitoring and early warning efforts for a total of 464 tweets. Using this information, a zero-shot classifier was applied to tweets from step 1 with a set of keep (drug arrest, drug discovery, drug report) and not keep (drug addiction support, public safety report, other) labels to see how accurately it could extract the tweets identified in step 2.

Results:

When looking at the accuracy in identifying relevant posts, the system extracted a total of 523 tweets and had an overlap of 397/477 (specificity of ~83.2%) with the subject matter experts. Conversely, the system identified a total of 3,184 irrelevant tweets and had an overlap of 3,104/3,230 (sensitivity of ~96.1%) with the subject matter experts.

Conclusions:

This study demonstrates the benefits of using artificial intelligence to assist in finding relevant tweets for an Early Warning System. The results showed that it can be quite accurate in filtering out irrelevant information which greatly reduces the amount of manual work required. Although the accuracy in retaining relevant information was observed to be lower, an analysis showed that the label definitions can impact the results significantly and would therefore be suitable for future work to refine. Nonetheless, the performance is promising and demonstrates the usefulness of artificial intelligence in this domain.


 Citation

Please cite as:

Fisher A, Young M, Payer D, Pacheco K, Dubeau C, Mago V

Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework

J Med Internet Res 2023;25:e43630

DOI: 10.2196/43630

PMID: 37725410

PMCID: 10548323

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.