Fine-tuned Sentiment Analysis of COVID-19 Vaccine-Related Social Media Data: Comparative Study
Chad A Melton, Brianna M White, Robert L Davis, Robert A Bednarczyk,, Arash Shaban-Nejad

TL;DR
This study compares public sentiment on COVID-19 vaccines expressed on Reddit and Twitter using a fine-tuned DistilRoBERTa model trained on millions of social media comments, revealing platform-specific sentiment trends during the pandemic.
Contribution
The paper introduces a fine-tuned DistilRoBERTa model for large-scale sentiment analysis of COVID-19 vaccine-related social media data, with dataset augmentation and cross-platform comparison.
Findings
Twitter sentiment was more negative (52% positive)
Reddit sentiment was more positive (53% positive)
Both platforms showed similar sentiment trends during key vaccine events
Abstract
This study investigated and compared public sentiment related to COVID-19 vaccines expressed on two popular social media platforms, Reddit and Twitter, harvested from January 1, 2020, to March 1, 2022. To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict sentiments of approximately 9.5 million Tweets and 70 thousand Reddit comments. To fine-tune our model, our team manually labeled the sentiment of 3600 Tweets and then augmented our dataset by the method of back-translation. Text sentiment for each social media platform was then classified with our fine-tuned model using Python and the Huggingface sentiment analysis pipeline. Our results determined that the average sentiment expressed on Twitter was more negative (52% positive) than positive and the sentiment expressed on Reddit was more positive than negative (53% positive). Though average sentiment was found…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
