Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated Negotiation

Xiangyu Liu; Di Wang; Zhe Feng; Aranyak Mehta

arXiv:2602.19309·cs.MA·February 27, 2026

Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated Negotiation

Xiangyu Liu, Di Wang, Zhe Feng, Aranyak Mehta

PDF

Open Access

TL;DR

This paper introduces a novel inference-time scaling method for large language models that enables online strategic adaptation in repeated negotiations by simulating opponents, leading to improved performance without additional training.

Contribution

The paper proposes a scalable inference-time approach using opponent simulation and smooth Fictitious Play principles for online strategic adaptation in LLMs.

Findings

01

Significant performance improvements in repeated negotiation games.

02

Effective online adaptation without parameter updates.

03

Scalable inference-time computation method.

Abstract

While large language models (LLMs) have emerged as powerful decision-makers across a wide range of single-agent and stationary environments, fewer efforts have been devoted to settings where LLMs must engage in \emph{repeated} and \emph{strategic} interactions with unknown or dynamic opponents. In such settings, recipes built upon \emph{offline} pre-training or fine-tuning, though robust against worst-case adversaries, do not fully exploit the capability of LLMs to adapt \emph{online} based on interaction feedback. Instead, we explore the more natural perspective of scaling inference-time computation as a mechanism for adaptation, embedding the principles of a classical game-theoretical learning dynamic, \emph{smooth Fictitious Play (sFP)}, into LLM inference: (i) for belief formation, we employ an auxiliary opponent model that in-context learns to imitate the time-averaged behavior of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Game Theory and Applications