Synthetic Disinformation Attacks on Automated Fact Verification Systems

Yibing Du; Antoine Bosselut; Christopher D. Manning

arXiv:2202.09381·cs.CL·February 22, 2022

Synthetic Disinformation Attacks on Automated Fact Verification Systems

Yibing Du, Antoine Bosselut, Christopher D. Manning

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how automated fact-checking systems are vulnerable to synthetic disinformation attacks, revealing significant performance drops when faced with fabricated or altered evidence sources, highlighting emerging threats from advanced NLP generators.

Contribution

It introduces two novel adversarial attack scenarios on fact-checkers and demonstrates their effectiveness across multiple models and benchmarks, emphasizing the need for more robust verification methods.

Findings

01

Fact-checkers' performance drops significantly under synthetic evidence attacks.

02

Both fabricated and modified evidence sources can deceive automated fact-checkers.

03

Modern NLG systems pose a growing threat as generators of disinformation.

Abstract

Automated fact-checking is a needed technology to curtail the spread of online misinformation. One current framework for such solutions proposes to verify claims by retrieving supporting or refuting evidence from related textual sources. However, the realistic use cases for fact-checkers will require verifying claims against evidence sources that could be affected by the same misinformation. Furthermore, the development of modern NLP tools that can produce coherent, fabricated content would allow malicious actors to systematically generate adversarial disinformation for fact-checkers. In this work, we explore the sensitivity of automated fact-checkers to synthetic adversarial evidence in two simulated settings: AdversarialAddition, where we fabricate documents and add them to the evidence repository available to the fact-checking system, and AdversarialModification, where existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yibing-du/adversarial-factcheck
pytorchOfficial

Videos

Synthetic Disinformation Attacks on Automated Fact Verification Systems· underline

Taxonomy

TopicsMisinformation and Its Impacts · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques