FaithDial: A Faithful Benchmark for Information-Seeking Dialogue
Nouha Dziri, Ehsan Kamalloo, Sivan Milton, Osmar Zaiane, Mo Yu,, Edoardo M. Ponti, Siva Reddy

TL;DR
FaithDial is a new benchmark dataset designed to reduce hallucinations in information-seeking dialogues, improving faithfulness and engagement in dialogue systems through data editing and auxiliary training objectives.
Contribution
The paper introduces FaithDial, a benchmark created by editing hallucinated responses, and demonstrates its effectiveness in training models with improved faithfulness and dialogue quality.
Findings
FaithDial reduces hallucinations compared to WoW.
Models trained on FaithDial outperform others in faithfulness and engagement.
FaithDial benefits zero-shot transfer to other datasets.
Abstract
The goal of information-seeking dialogue is to respond to seeker queries with natural language utterances that are grounded on knowledge sources. However, dialogue systems often produce unsupported utterances, a phenomenon known as hallucination. To mitigate this behavior, we adopt a data-centric solution and create FaithDial, a new benchmark for hallucination-free dialogues, by editing hallucinated responses in the Wizard of Wikipedia (WoW) benchmark. We observe that FaithDial is more faithful than WoW while also maintaining engaging conversations. We show that FaithDial can serve as training signal for: i) a hallucination critic, which discriminates whether an utterance is faithful or not, and boosts the performance by 12.8 F1 score on the BEGIN benchmark compared to existing datasets for dialogue coherence; ii) high-quality dialogue generation. We benchmark a series of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsWizard: Unsupervised goats tracking algorithm
