AIDG: Evaluating Asymmetry Between Information Extraction and Containment in Multi-Turn Dialogue

Adib Sakhawat; Fardeen Sadab; Rakin Shahriar

arXiv:2602.17443·cs.CL·February 20, 2026

AIDG: Evaluating Asymmetry Between Information Extraction and Containment in Multi-Turn Dialogue

Adib Sakhawat, Fardeen Sadab, Rakin Shahriar

PDF

Open Access

TL;DR

This paper introduces AIDG, a game-theoretic framework to evaluate LLMs' abilities in dynamic dialogue, revealing a significant asymmetry favoring containment over active deduction in multi-turn interactions.

Contribution

The paper presents AIDG, a novel framework with two tasks to measure information deduction and containment, highlighting the asymmetry in LLMs' strategic reasoning capabilities.

Findings

01

Models perform better at containment than deduction.

02

Confirmation strategies are significantly more effective than blind deduction.

03

Instruction-following degrades under conversational load, affecting deduction.

Abstract

Evaluating the strategic reasoning capabilities of Large Language Models (LLMs) requires moving beyond static benchmarks to dynamic, multi-turn interactions. We introduce AIDG (Adversarial Information Deduction Game), a game-theoretic framework that probes the asymmetry between information extraction (active deduction) and information containment (state maintenance) in dialogue. We propose two complementary tasks: AIDG-I, measuring pragmatic strategy in social deduction, and AIDG-II, measuring constraint satisfaction in a structured "20 Questions" setting. Across 439 games with six frontier LLMs, we observe a clear capability asymmetry: models perform substantially better at containment than deduction, with a 350 ELO advantage on defense;(Cohen's d = 5.47). We identify two bottlenecks driving this gap: (1) Information Dynamics, where confirmation strategies are 7.75x more effective than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech and dialogue systems · Advanced Graph Neural Networks