TL;DR
VitaminC is a new benchmark dataset for fact verification that emphasizes subtle evidence differences, enhancing model robustness against changing information and supporting multiple related tasks.
Contribution
The paper introduces VitaminC, a challenging fact verification benchmark with contrastive evidence pairs, and demonstrates its effectiveness in improving model robustness and enabling auxiliary fact-checking tasks.
Findings
Training on VitaminC improves adversarial verification accuracy by 10%.
The dataset enhances robustness in natural language inference tasks.
Additional tasks like word tagging and revision detection are enabled by VitaminC.
Abstract
Typical fact verification models use retrieved written evidence to verify claims. Evidence sources, however, often change over time as more information is gathered and revised. In order to adapt, models must be sensitive to subtle differences in supporting evidence. We present VitaminC, a benchmark infused with challenging cases that require fact verification models to discern and adjust to slight factual changes. We collect over 100,000 Wikipedia revisions that modify an underlying fact, and leverage these revisions, together with additional synthetically constructed ones, to create a total of over 400,000 claim-evidence pairs. Unlike previous resources, the examples in VitaminC are contrastive, i.e., they contain evidence pairs that are nearly identical in language and content, with the exception that one supports a given claim while the other does not. We show that training using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
