MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

Yunfei Xie; Kevin Wang; Bobby Cheng; Jianzhu Yao; Zhizhou Sha; Alexander Duffy; Yihan Xi; Hongyuan Mei; Cheston Tan; Chen Wei; Pramod Viswanath; Zhangyang Wang

arXiv:2603.09022·cs.AI·March 20, 2026

MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

Yunfei Xie, Kevin Wang, Bobby Cheng, Jianzhu Yao, Zhizhou Sha, Alexander Duffy, Yihan Xi, Hongyuan Mei, Cheston Tan, Chen Wei, Pramod Viswanath, Zhangyang Wang

PDF

Open Access

TL;DR

MEMO is a self-play framework that optimizes model context with memory and exploration techniques, significantly improving stability and performance in multi-agent LLM games across various tasks.

Contribution

It introduces a novel memory-augmented context optimization method that reduces variance and enhances robustness in multi-turn, multi-agent LLM interactions.

Findings

01

Mean win rate nearly doubled across five games.

02

Run-to-run variance decreased, stabilizing rankings.

03

Largest gains observed in negotiation and imperfect-information games.

Abstract

Multi-turn, multi-agent LLM game evaluations often exhibit substantial run-to-run variance. In long-horizon interactions, small early deviations compound across turns and are amplified by multi-agent coupling. This biases win rate estimates and makes rankings unreliable across repeated tournaments. Prompt choice worsens this further by producing different effective policies. We address both instability and underperformance with MEMO (Memory-augmented MOdel context optimization), a self-play framework that optimizes inference-time context by coupling retention and exploration. Retention maintains a persistent memory bank that stores structured insights from self-play trajectories and injects them as priors during later play. Exploration runs tournament-style prompt evolution with uncertainty-aware selection via TrueSkill, and uses prioritized replay to revisit rare and decisive states.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Reinforcement Learning in Robotics · Educational Games and Gamification