A Labelled Dataset for Sentiment Analysis of Videos on YouTube, TikTok, and Other Sources about the 2024 Outbreak of Measles
Nirmalya Thakur, Vanessa Su, Mingchen Shao, Kesha A. Patel, Hongseok, Jeong, Victoria Knieling, and Andrew Bian

TL;DR
This paper introduces a comprehensive dataset of 4011 videos related to the 2024 measles outbreak from various social media platforms, along with sentiment and subjectivity annotations, to facilitate research in video sentiment analysis.
Contribution
It provides a novel, annotated dataset of videos about the measles outbreak, including sentiment and subjectivity labels, enabling advanced machine learning research in social media video analysis.
Findings
Sentiment analysis classified videos into positive, negative, or neutral.
Subjectivity analysis categorized videos as opinionated or neutral.
Fine-grain sentiment analysis identified emotions like fear, joy, or sadness.
Abstract
The work of this paper presents a dataset that contains the data of 4011 videos about the ongoing outbreak of measles published on 264 websites on the internet between January 1, 2024, and May 31, 2024. The dataset is available at https://dx.doi.org/10.21227/40s8-xf63. These websites primarily include YouTube and TikTok, which account for 48.6% and 15.2% of the videos, respectively. The remainder of the websites include Instagram and Facebook as well as the websites of various global and local news organizations. For each of these videos, the URL of the video, title of the post, description of the post, and the date of publication of the video are presented as separate attributes in the dataset. After developing this dataset, sentiment analysis (using VADER), subjectivity analysis (using TextBlob), and fine-grain sentiment analysis (using DistilRoBERTa-base) of the video titles and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Vaccine Coverage and Hesitancy · Social Media in Health Education
