Accepted for/Published in: JMIR Medical Informatics
Date Submitted: Apr 13, 2021
Open Peer Review Period: Apr 13, 2021 - Jun 8, 2021
Date Accepted: Jul 19, 2021
(closed for review but you can still tweet)
Categorising Vaccine Confidence with Transformer-Based Machine Learning Model: The Nuances of Vaccine Sentiment within Twitter Discourse
ABSTRACT
Background:
With growing conversations online and less than desired maternal vaccination uptake rates, these conversations could provide useful insight to inform future interventions. Automated processes for this type of analysis, such as natural language processing (NLP), have faced challenges extracting complex stances, like attitudes toward vaccines, from large text.
Objective:
In this study, we aimed to build upon recent advances in Transformer-based machine learning methods, and test if this could be used as a tool to assess the stance of social media posts towards vaccination during pregnancy.
Methods:
A total of 16,604 Tweets posted between 1 November 2018 and 30 April 2019 were selected by boolean searches related to maternal vaccination. Tweets were coded by three individual researchers into the categories “Promotional”, “Discouraging”, “Ambiguous” and “Neutral” After creating a final dataset of 2,722 unique tweets, multiple machine learning methods were trained on the dataset and then tested and compared to the human annotators.
Results:
We received an accuracy of 81.8% (F-score= 0.78) compared to the agreed score between the three annotators. For comparison, the accuracies of the individual annotators compared to the final score were 83.3%, 77.9% and 77.5%.
Conclusions:
This study demonstrates the ability to achieve close to the same accuracy in categorising tweets using our machine learning models as could be expected by a single human annotator. The potential to use this reliable and accurate automated process could free up valuable time and resource constraints of conducting this analysis, in addition to inform potentially effective and necessary interventions. Clinical Trial: N/A
Citation
Request queued. Please wait while the file is being generated. It may take some time.
Copyright
© The authors. All rights reserved. This is a privileged document currently under peer-review/community review (or an accepted/rejected manuscript). Authors have provided JMIR Publications with an exclusive license to publish this preprint on it's website for review and ahead-of-print citation purposes only. While the final peer-reviewed paper may be licensed under a cc-by license on publication, at this stage authors and publisher expressively prohibit redistribution of this draft paper other than for review purposes.