Who Shapes Brazil's Vaccine Debate? Semi-Supervised Modeling of Stance and Polarization in YouTube's Media Ecosystem
Geovana S. de Oliveira, Ana P. C. Silva, Fabricio Murai, Carlos H. G. Ferreira

TL;DR
This study analyzes Brazil's vaccine discourse on YouTube over time, using semi-supervised learning to classify comments and reveal polarization patterns, especially during COVID-19, with implications for health communication strategies.
Contribution
It introduces a large-scale, longitudinal semi-supervised stance detection framework applied to Brazil's YouTube vaccine comments, improving classification robustness and understanding polarization dynamics.
Findings
Polarization spikes during epidemiological crises like COVID-19.
Science and digital-native channels are key sites of vaccine discourse.
Semi-supervised learning enhances stance classification accuracy.
Abstract
Vaccination remains a cornerstone of global public health, yet the COVID-19 pandemic exposed how online misinformation, political polarization, and declining institutional trust can undermine immunization efforts. Most of the prior computational studies that analyzed vaccine discourse on social platforms focus on English-language data, specific vaccines, or short time windows, impairing our understanding of long-term dynamics in high-impact, non-English contexts like Brazil, home to one of the world's most comprehensive immunization systems. We here present the largest longitudinal study of Brazil's vaccine discourse on YouTube, leveraging a semi-supervised stance detection framework that combines self-labeling and self-training to classify nearly 1.4 million comments. By integrating stance with temporal patterns, engagement metrics, and channel taxonomy (legacy media, science…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
