Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games
Gabriele Farina, Christian Kroer, Tuomas Sandholm

TL;DR
This paper introduces a generalized regret minimization framework for sequential decision processes and extensive-form games with convex sets and losses, extending existing algorithms and enabling scalable solutions for large games.
Contribution
It develops laminar regret decomposition, generalizes CFR to convex settings, and introduces new algorithms for regularized equilibria and opponent exploitation.
Findings
Framework scales comparably to fastest CFR variants
First algorithm for quantal response equilibria in large games
Enables scalable opponent exploitation methods
Abstract
Regret minimization is a powerful tool for solving large-scale extensive-form games. State-of-the-art methods rely on minimizing regret locally at each decision point. In this work we derive a new framework for regret minimization on sequential decision problems and extensive-form games with general compact convex sets at each decision point and general convex losses, as opposed to prior work which has been for simplex decision points and linear losses. We call our framework laminar regret decomposition. It generalizes the CFR algorithm to this more general setting. Furthermore, our framework enables a new proof of CFR even in the known setting, which is derived from a perspective of decomposing polytope regret, thereby leading to an arguably simpler interpretation of the algorithm. Our generalization to convex compact sets and convex losses allows us to develop new algorithms for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
