MAD: A Benchmark for Multi-Turn Audio Dialogue Fact-Checking

Chaewan Chun; Lysandre Terrisse; Delvin Ce Zhang; Dongwon Lee

arXiv:2508.12186·cs.SI·August 19, 2025

MAD: A Benchmark for Multi-Turn Audio Dialogue Fact-Checking

Chaewan Chun, Lysandre Terrisse, Delvin Ce Zhang, Dongwon Lee

PDF

Open Access

TL;DR

MAD introduces a novel multi-turn audio dialogue dataset for fact-checking, capturing complex spoken misinformation dynamics and providing benchmarks that reveal current model limitations in conversational audio verification.

Contribution

This paper presents MAD, the first dataset for multi-turn spoken dialogue fact-checking with detailed annotations, enabling research on conversational and acoustic misinformation detection.

Findings

01

Pretrained models achieve 72-74% accuracy at sentence level.

02

Models reach 71-72% accuracy at dialogue level.

03

MAD exposes significant challenges in multimodal conversational fact-checking.

Abstract

Despite the growing popularity of audio platforms, fact-checking spoken content remains significantly underdeveloped. Misinformation in speech often unfolds across multi-turn dialogues, shaped by speaker interactions, disfluencies, overlapping speech, and emotional tone-factors that complicate both claim detection and verification. Existing datasets fall short by focusing on isolated sentences or text transcripts, without modeling the conversational and acoustic complexity of spoken misinformation. We introduce MAD (Multi-turn Audio Dialogues), the first fact-checking dataset aligned with multi-turn spoken dialogues and corresponding audio. MAD captures how misinformation is introduced, contested, and reinforced through natural conversation. Each dialogue includes annotations for speaker turns, dialogue scenarios, information spread styles, sentence-level check-worthiness, and both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Advanced Text Analysis Techniques