Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game
Mustafa O. Karabag, Jan Sobotka, Ufuk Topcu

TL;DR
This paper investigates whether large language models can strategically control information in a game setting, revealing or concealing secrets, through theoretical analysis and empirical experiments with multiple models.
Contribution
It introduces the Chameleon game to evaluate LLMs' information control capabilities and demonstrates how internal representation manipulation can induce desired behaviors.
Findings
LLMs can identify the chameleon but struggle to conceal secrets.
Instructions alone often fail to induce concealment behavior.
Linear manipulation of internal representations can reliably produce concealment.
Abstract
Large language model-based (LLM-based) agents have become common in settings that include non-cooperative parties. In such settings, agents' decision-making needs to conceal information from their adversaries, reveal information to their cooperators, and infer information to identify the other agents' characteristics. To investigate whether LLMs have these information control and decision-making capabilities, we make LLM agents play the language-based hidden-identity game, The Chameleon. In this game, a group of non-chameleon agents who do not know each other aim to identify the chameleon agent without revealing a secret. The game requires the aforementioned information control capabilities both as a chameleon and a non-chameleon. We begin with a theoretical analysis for a spectrum of strategies, from concealing to revealing, and provide bounds on the non-chameleons' winning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBusiness Law and Ethics
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Residual Connection · Multi-Head Attention · Label Smoothing · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Softmax
