Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: JMIR Formative Research

Date Submitted: Aug 24, 2022
Date Accepted: Nov 17, 2022
Date Submitted to PubMed: Dec 22, 2022

The final, peer-reviewed published version of this preprint can be found here:

Estimating County-Level Overdose Rates Using Opioid-Related Twitter Data: Interdisciplinary Infodemiology Study

Cuomo R, Purushothaman V, Calac A, McMann T, Li Z, Mackey T

Estimating County-Level Overdose Rates Using Opioid-Related Twitter Data: Interdisciplinary Infodemiology Study

JMIR Form Res 2023;7:e42162

DOI: 10.2196/42162

PMID: 36548118

PMCID: 9909516

Estimating County-Level Overdose Rates using Opioid-Related Twitter Data: Interdisciplinary Infodemiology Study

  • Raphael Cuomo; 
  • Vidya Purushothaman; 
  • Alec Calac; 
  • Tiana McMann; 
  • Zhuoran Li; 
  • Tim Mackey

ABSTRACT

Background:

There was an estimated 100,000 drug overdose deaths between April 2020 and April 2021, a three-quarters increase from the prior 12-month period. There is an approximate six-month reporting lag for provisional counts of drug overdose deaths from the National Vital Statistics System, and the highest level of geospatial resolution is at the state level. By contrast, public social media data are available close to real-time and are often accessible with precise coordinates.

Objective:

We sought to assess whether small area overdose mortality burden could be estimated using opioid-related social media data.

Methods:

ICD codes for poisoning/exposure to overdose at the county level were obtained from CDC Wonder. Demographics were collected from the American Community Survey. The Twitter API was used to obtain tweets which contained any of 36 terms with drug names. An unsupervised classification approach was used for clustering tweets. Population-normalized variables and polynomial population-normalized variables were produced. Furthermore, z-scores of the Getis Ord Gi clustering statistic were produced, and both these scores and their polynomial counterparts were explored in regression modeling of small area overdose mortality burden.

Results:

Modeling of overdose mortality with normalized demographic variables alone explained only 7.4% of the variability in county-level overdose mortality, whereas this was approximately doubled by the use of specific demographic and social media covariates based on a backwards selection approach.

Conclusions:

Social media data, when transformed using certain statistical approaches, may add utility in the goal of producing closer to real-time small area estimates of overdose mortality.


 Citation

Please cite as:

Cuomo R, Purushothaman V, Calac A, McMann T, Li Z, Mackey T

Estimating County-Level Overdose Rates Using Opioid-Related Twitter Data: Interdisciplinary Infodemiology Study

JMIR Form Res 2023;7:e42162

DOI: 10.2196/42162

PMID: 36548118

PMCID: 9909516

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.