Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Wednesday, July 01, 2020 at 8:00 PM to 10:00 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Accepted for/Published in: Journal of Medical Internet Research

Date Submitted: Aug 28, 2021
Date Accepted: Nov 1, 2021
Date Submitted to PubMed: Nov 29, 2021

The final, peer-reviewed published version of this preprint can be found here:

Predictive Modeling of Vaccination Uptake in US Counties: A Machine Learning–Based Approach

Cheong Q, Au-yeung M, Quon S, Concepcion K, Kong JD

Predictive Modeling of Vaccination Uptake in US Counties: A Machine Learning–Based Approach

J Med Internet Res 2021;23(11):e33231

DOI: 10.2196/33231

PMID: 34751650

PMCID: 8623305

Predictive Modeling of Vaccination Uptake in U.S. Counties: A Machine Learning-based Approach

  • Queena Cheong; 
  • Martin Au-yeung; 
  • Stephanie Quon; 
  • Katsy Concepcion; 
  • Jude Dzevela Kong

ABSTRACT

Background:

While the COVID-19 pandemic has left an unprecedented impact globally, countries such as the United States of America have reported the most significant incidence of COVID-19 cases worldwide. Within the U.S., various sociodemographic factors have played an essential role in the creation of regional disparities. Regional disparities have resulted in the unequal spread of disease between U.S. counties, underscoring the need for efficient and accurate predictive modelling strategies to inform public health officials and reduce the burden on healthcare systems. Furthermore, despite the widespread accessibility of COVID-19 vaccines across the U.S., vaccination rates have become stagnant, necessitating predictive modelling to identify important factors impacting vaccination uptake.

Objective:

To determine the association between sociodemographic factors and vaccine uptake across counties in the U.S.

Methods:

Sociodemographic data on fully vaccinated and unvaccinated individuals were sourced from several online databases, such as the U.S. Centre for Disease Control and U.S. Census Bureau COVID-19 Site. Machine learning analysis was performed using XGBoost and sociodemographic data.

Results:

Our model predicted COVID-19 vaccination uptake across U.S. countries with 59% accuracy. In addition, it identified location, education, ethnicity, and income as the most critical sociodemographic features in predicting vaccination uptake in U.S. counties. Lastly, the model produced a choropleth demonstrating areas of low and high vaccination rates, which can be used by healthcare authorities in future pandemics to visualize and prioritize areas of low vaccination and design targeted vaccination campaigns.

Conclusions:

Our study reveals that sociodemographic characteristics are predictors of vaccine uptake rate across counties in the U.S. and if leveraged appropriately can assist policy makers and public health officials to understand vaccine uptake rates and craft policies to improve them.


 Citation

Please cite as:

Cheong Q, Au-yeung M, Quon S, Concepcion K, Kong JD

Predictive Modeling of Vaccination Uptake in US Counties: A Machine Learning–Based Approach

J Med Internet Res 2021;23(11):e33231

DOI: 10.2196/33231

PMID: 34751650

PMCID: 8623305

Download PDF


Request queued. Please wait while the file is being generated. It may take some time.

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.