LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems
Jo\~ao A. Leite, Olesya Razuvayevskaya, Kalina Bontcheva, Carolina Scarton

TL;DR
This paper introduces a novel adversarial attack method on automated fact-checking systems that uses persuasion techniques generated by large language models to evade detection and manipulate verification results.
Contribution
It presents a new class of adversarial attacks leveraging persuasion techniques via generative LLMs, revealing vulnerabilities in current AFC systems.
Findings
Persuasion attacks significantly reduce verification accuracy.
Evidence retrieval performance is also degraded by persuasion techniques.
Persuasion techniques are identified as a potent adversarial tool.
Abstract
Automated fact-checking (AFC) systems are susceptible to adversarial attacks, enabling false claims to evade detection. Existing adversarial frameworks typically rely on injecting noise or altering semantics, yet no existing framework exploits the adversarial potential of persuasion techniques, which are widely used in disinformation campaigns to manipulate audiences. In this paper, we introduce a novel class of persuasive adversarial attacks on AFCs by employing a generative LLM to rephrase claims using persuasion techniques. Considering 15 techniques grouped into 6 categories, we study the effects of persuasion on both claim verification and evidence retrieval using a decoupled evaluation strategy. Experiments on the FEVER and FEVEROUS benchmarks show that persuasion attacks can substantially degrade both verification performance and evidence retrieval. Our analysis identifies…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
