WhatsApp Vaccine Discourse (WhaVax): An Expert-Annotated Dataset and Benchmark for Health Misinformation Detection

J\^onatas H. dos Santos; Julio C. S. Reis; Philipe Melo; Jo\~ao F. H. Olivetti; Thales H. Silva; Matheus Gontijo Guimaraes; Glaucio de Souza; Marcos A. Gon\c{c}alves; Fabricio Benevenuto; Filipe B. B. Zanovello; Marco A. G. Rodrigues; Cristiano X. Lima

arXiv:2605.12510·cs.SI·May 14, 2026

WhatsApp Vaccine Discourse (WhaVax): An Expert-Annotated Dataset and Benchmark for Health Misinformation Detection

J\^onatas H. dos Santos, Julio C. S. Reis, Philipe Melo, Jo\~ao F. H. Olivetti, Thales H. Silva, Matheus Gontijo Guimaraes, Glaucio de Souza, Marcos A. Gon\c{c}alves, Fabricio Benevenuto, Filipe B. B. Zanovello, Marco A. G. Rodrigues, Cristiano X. Lima

PDF

TL;DR

This paper introduces WhaVax, a high-quality expert-annotated dataset of vaccine-related WhatsApp messages from Brazil, and benchmarks various models for health misinformation detection in private messaging.

Contribution

It presents a carefully constructed, expert-annotated dataset and a comprehensive analysis of misinformation patterns, along with benchmarking of multiple modeling approaches.

Findings

01

Large language models perform competitively in misinformation detection.

02

Domain alignment and data availability are critical for model performance.

03

The dataset exhibits complex linguistic and structural misinformation patterns.

Abstract

We introduce WhaVax, a new expert-annotated dataset of vaccine-related WhatsApp messages collected from large Brazilian public groups spanning multiple pandemic years. The dataset was constructed through a rigorous, carefully designed pipeline that integrates keyword-based data collection, semantic deduplication to remove near-duplicate content, and a multi-stage annotation protocol conducted by medical specialists. This process produced a high-quality gold-standard corpus, characterized by substantial inter-annotator agreement and strong reliability for downstream analysis. Additionally, we provide a detailed characterization of WhatsApp misinformation, revealing distinctive linguistic, structural, lexical, temporal, and group-level patterns, as well as a meaningful layer of ambiguous cases that reflect the complexity of health discourse in private messaging. We also benchmark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.