Solving Imperfect-Recall Games via Sum-of-Squares Optimization
Rui Zheng, Ryann Sim, Antonios Varvitsiotis

TL;DR
This paper introduces sum-of-squares hierarchies for computing strategies in imperfect-recall extensive-form games, providing theoretical convergence guarantees and efficient algorithms for certain game classes.
Contribution
It develops SOS-based methods for equilibrium computation in IREFGs, including convergence analysis and new game classes with simplified solutions.
Findings
SOS hierarchies converge asymptotically
Finite convergence under genericity assumptions
Single-level SOS hierarchy for certain game classes
Abstract
Extensive-form games (EFGs) provide a powerful framework for modeling sequential decision making, capturing strategic interaction under imperfect information, chance events, and temporal structure. Most positive algorithmic and theoretical results for EFGs assume perfect recall, where players remember all past information and actions. We study the increasingly relevant setting of imperfect-recall EFGs (IREFGs), where players may forget parts of their history or previously acquired information, and where equilibrium computation is provably hard. We propose sum-of-squares (SOS) hierarchies for computing ex-ante optimal strategies in single-player IREFGs and Nash equilibria in multi-player IREFGs, working over behavioral strategies. Our theoretical results show that (i) these hierarchies converge asymptotically, (ii) under genericity assumptions, the convergence is finite, and (iii) in…
Peer Reviews
Decision·Submitted to ICLR 2026
1) The tackles an important problem that has been receiving increasingly popular attention. The authors do an excellent job covering some of the recent (and more classic) related literature on imperfect recall games, and motivate their setting by discussing their applications in solving large games via abstractions, modeling team games, and for safety & security. The existing negative results are clearly presented and therefore it is very clear where the contributions of this paper fits in the l
1) At times the paper becomes incredibly dense with overwhelmingly many definitions and notation. I found sections 2.2 and 3 particularly difficult to follow for someone without significant experience in sum of squares / polynomial optimization. While this is somewhat inevitable due to the many concepts required for defining both imperfect-recall extensive-form games and SOS hierarchies and the limited space, it does end up decreasing the accessibility of the paper. I would suggest the authors t
The paper is well written and clear. I find the idea of using SoS as a relaxation for imperfect recall games very interesting; indeed, it is an idea that I myself have toyed around with a bit, albeit without much progress.
The paper should also cite and compare to [1, 2], which covers timeable two-player zero-sum imperfect-recall games (team games). The techniques used in these papers, although not SoS, are basically "lift-and-project"-style algorithms, and have the same flavor of complexity that depends on a "degree-like" parameter, which the papers characterize. My main criticism is that the paper feels a bit preliminary. The results follow mostly from basically restating known results in SoS land once one has
* Improves one of the hardness-results for single-player imperfect recall games. * Deepens the connection between Moment-SOS and the polynomial games, which was proposed in [1]. * Provides practical usage of Moment-SOS for imperfect recall games and the behavior of Moment-SOS for some subclasses of those game.
* Most of the results seem to directly follow from the definition of imperfect recall games as a polynomial game, the construction provided in [1] and the behavior of Moment-SOS. * Without the prior knowledge the concepts introduced in Section 2.2 would be really difficult to follow, which authors probably realized and provided 2 works that delve more into those concepts. * The clashing notation of SOS and games makes the paper difficult to follow at first. Some notational details are not defin
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics
