LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems

Jo\~ao A. Leite; Olesya Razuvayevskaya; Kalina Bontcheva; Carolina Scarton

arXiv:2601.16890·cs.CL·January 26, 2026

LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems

Jo\~ao A. Leite, Olesya Razuvayevskaya, Kalina Bontcheva, Carolina Scarton

PDF

Open Access

TL;DR

This paper introduces a novel adversarial attack method on automated fact-checking systems that uses persuasion techniques generated by large language models to evade detection and manipulate verification results.

Contribution

It presents a new class of adversarial attacks leveraging persuasion techniques via generative LLMs, revealing vulnerabilities in current AFC systems.

Findings

01

Persuasion attacks significantly reduce verification accuracy.

02

Evidence retrieval performance is also degraded by persuasion techniques.

03

Persuasion techniques are identified as a potent adversarial tool.

Abstract

Automated fact-checking (AFC) systems are susceptible to adversarial attacks, enabling false claims to evade detection. Existing adversarial frameworks typically rely on injecting noise or altering semantics, yet no existing framework exploits the adversarial potential of persuasion techniques, which are widely used in disinformation campaigns to manipulate audiences. In this paper, we introduce a novel class of persuasive adversarial attacks on AFCs by employing a generative LLM to rephrase claims using persuasion techniques. Considering 15 techniques grouped into 6 categories, we study the effects of persuasion on both claim verification and evidence retrieval using a decoupled evaluation strategy. Experiments on the FEVER and FEVEROUS benchmarks show that persuasion attacks can substantially degrade both verification performance and evidence retrieval. Our analysis identifies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)