VaxxHesitancy: A Dataset for Studying Hesitancy towards COVID-19 Vaccination on Twitter
Yida Mu, Mali Jin, Charlie Grimshaw, Carolina Scarton, Kalina, Bontcheva, Xingyi Song

TL;DR
This paper introduces a new dataset of over 3,100 tweets annotated for COVID-19 vaccine hesitancy and develops VaxxBERT, a domain-specific language model that outperforms baselines in predicting vaccine attitudes.
Contribution
The paper presents the first dataset and model that distinguish vaccine hesitancy as a separate category from pro- and anti-vaccine stances.
Findings
VaxxBERT achieves 73.0% accuracy and 69.3 F1-score.
The dataset enables nuanced analysis of vaccine attitudes.
Vaccine hesitancy can be effectively modeled as a distinct category.
Abstract
Vaccine hesitancy has been a common concern, probably since vaccines were created and, with the popularisation of social media, people started to express their concerns about vaccines online alongside those posting pro- and anti-vaccine content. Predictably, since the first mentions of a COVID-19 vaccine, social media users posted about their fears and concerns or about their support and belief into the effectiveness of these rapidly developing vaccines. Identifying and understanding the reasons behind public hesitancy towards COVID-19 vaccines is important for policy markers that need to develop actions to better inform the population with the aim of increasing vaccine take-up. In the case of COVID-19, where the fast development of the vaccines was mirrored closely by growth in anti-vaxx disinformation, automatic means of detecting citizen attitudes towards vaccination became…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVaccine Coverage and Hesitancy · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection
