AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models

Adib Sakhawat; Fardeen Sadab

arXiv:2602.16639·cs.CL·February 19, 2026

AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models

Adib Sakhawat, Fardeen Sadab

PDF

Open Access

TL;DR

The paper introduces AREG, a benchmark for assessing both persuasion and resistance in large language models through adversarial negotiations, revealing their weak correlation and systematic resistance advantage.

Contribution

It presents a novel multi-turn negotiation framework to evaluate social influence in LLMs, highlighting the dissociation between persuasion and resistance capabilities.

Findings

01

Resistance scores are higher than persuasion scores across models.

02

Weak correlation ($\rho=0.33$) between persuasion and resistance.

03

Interaction structure significantly influences social influence outcomes.

Abstract

Evaluating the social intelligence of Large Language Models (LLMs) increasingly requires moving beyond static text generation toward dynamic, adversarial interaction. We introduce the Adversarial Resource Extraction Game (AREG), a benchmark that operationalizes persuasion and resistance as a multi-turn, zero-sum negotiation over financial resources. Using a round-robin tournament across frontier models, AREG enables joint evaluation of offensive (persuasion) and defensive (resistance) capabilities within a single interactional framework. Our analysis provides evidence that these capabilities are weakly correlated ( $ρ = 0.33$ ) and empirically dissociated: strong persuasive performance does not reliably predict strong resistance, and vice versa. Across all evaluated models, resistance scores exceed persuasion scores, indicating a systematic defensive advantage in adversarial dialogue…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Hate Speech and Cyberbullying Detection · Topic Modeling