Detecting the Presence of COVID-19 Vaccination Hesitancy from South African Twitter Data Using Machine Learning
Nicholas Perikli, Srimoy Bhattacharya, Blessing Ogbuokiri, Zahra, Movahedi Nia, Benjamin Lieberman, Nidhi Tripathi, Salah-Eddine Dahbi, Finn, Stevenson, Nicola Bragazzi, Jude Kong, Bruce Mellado

TL;DR
This study analyzes South African Twitter data to detect COVID-19 vaccine hesitancy using machine learning, finding that transformer-based models like BERT and RoBERTa outperform traditional methods in sentiment classification.
Contribution
It introduces a novel approach combining sentiment analysis with transformer models on South African social media data to identify vaccine hesitancy.
Findings
BERT and RoBERTa achieved F1-scores of 60% and 61%.
Transformer models outperform traditional ML models in this context.
Topic modeling on misclassified tweets provides insights for future improvements.
Abstract
Very few social media studies have been done on South African user-generated content during the COVID-19 pandemic and even fewer using hand-labelling over automated methods. Vaccination is a major tool in the fight against the pandemic, but vaccine hesitancy jeopardizes any public health effort. In this study, sentiment analysis on South African tweets related to vaccine hesitancy was performed, with the aim of training AI-mediated classification models and assessing their reliability in categorizing UGC. A dataset of 30000 tweets from South Africa were extracted and hand-labelled into one of three sentiment classes: positive, negative, neutral. The machine learning models used were LSTM, bi-LSTM, SVM, BERT-base-cased and the RoBERTa-base models, whereby their hyperparameters were carefully chosen and tuned using the WandB platform. We used two different approaches when we pre-processed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVaccine Coverage and Hesitancy · Misinformation and Its Impacts · Hate Speech and Cyberbullying Detection
MethodsAttention Is All You Need · Refunds@Expedia|||How do I get a full refund from Expedia? · Layer Normalization · Linear Layer · Dropout · Sigmoid Activation · WordPiece · Adam · Tanh Activation · Attention Dropout
