JMIR Preprints #29584: Categorising Vaccine Confidence with Transformer-Based Machine Learning Model: The Nuances of Vaccine Sentiment on Twitter

Current Preprint Settings

(as selected by the authors)

1. When the manuscript is submitted, allow peer review from:

(a) Anybody (open community peer review)
(b) Editor-selected reviewers (closed peer review)

2. When the manuscript is submitted, display the preprint PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

3. When the manuscript is accepted, display the accepted manuscript PDF to:

(a) Anybody, anytime
(b) Logged-in users only
(c) Anybody, anytime (title and abstract only)
(d) No one

Categorising Vaccine Confidence with Transformer-Based Machine Learning Model: The Nuances of Vaccine Sentiment on Twitter

Per Kummervold;
Sam Martin;
Sara Dada;
Eliz Kilich;
Chermain Denny;
Pauline Paterson;
Heidi J Larson

ABSTRACT

Background:

With growing conversations online and less than desired maternal vaccination uptake rates, these conversations could provide useful insight to inform future interventions. Automated processes for this type of analysis, such as natural language processing (NLP), have faced challenges extracting complex stances, like attitudes toward vaccines, from large text.

Objective:

In this study, we aimed to build upon recent advances in Transformer-based machine learning methods, and test if this could be used as a tool to assess the stance of social media posts towards vaccination during pregnancy.

Methods:

A total of 16,604 Tweets posted between 1 November 2018 and 30 April 2019 were selected by boolean searches related to maternal vaccination. Tweets were coded by three individual researchers into the categories “Promotional”, “Discouraging”, “Ambiguous” and “Neutral” After creating a final dataset of 2,722 unique tweets, multiple machine learning methods were trained on the dataset and then tested and compared to the human annotators.

Results:

We received an accuracy of 81.8% (F-score= 0.78) compared to the agreed score between the three annotators. For comparison, the accuracies of the individual annotators compared to the final score were 83.3%, 77.9% and 77.5%.

Conclusions:

This study demonstrates the ability to achieve close to the same accuracy in categorising tweets using our machine learning models as could be expected by a single human annotator. The potential to use this reliable and accurate automated process could free up valuable time and resource constraints of conducting this analysis, in addition to inform potentially effective and necessary interventions. Clinical Trial: N/A

Citation

Please cite as:

Kummervold P, Martin S, Dada S, Kilich E, Denny C, Paterson P, Larson HJ

Categorizing Vaccine Confidence With a Transformer-Based Machine Learning Model: Analysis of Nuances of Vaccine Sentiment in Twitter Discourse

JMIR Med Inform 2021;9(10):e29584

DOI: 10.2196/29584

PMID: 34623312

PMCID: 8538052

Download PDF

Request queued. Please wait while the file is being generated. It may take some time.

Copyright

© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.

JMIR Publications

JMIR Preprints

Accepted for/Published in: JMIR Medical Informatics

Date Submitted: Apr 13, 2021

Open Peer Review Period: Apr 13, 2021 - Jun 8, 2021

Date Accepted: Jul 19, 2021

(closed for review but you can still tweet)

Categorising Vaccine Confidence with Transformer-Based Machine Learning Model: The Nuances of Vaccine Sentiment on Twitter

ABSTRACT

Citation

Copyright