Learning and Computation of $\Phi$-Equilibria at the Frontier of Tractability
Brian Hu Zhang, Ioannis Anagnostides, Emanuel Tewolde, Ratip Emin Berker, Gabriele Farina, Vincent Conitzer, Tuomas Sandholm

TL;DR
This paper extends the computational framework for $\
Contribution
It introduces algorithms for computing $\
Findings
Polynomial-time algorithm for $\
Nearly matching lower bounds established
Extension of $\
Abstract
-equilibria -- and the associated notion of -regret -- are a powerful and flexible framework at the heart of online learning and game theory, whereby enriching the set of deviations begets stronger notions of rationality. Recently, Daskalakis, Farina, Fishelson, Pipis, and Schneider (STOC '24) -- abbreviated as DFFPS -- settled the existence of efficient algorithms when contains only linear maps under a general, -dimensional convex constraint set . In this paper, we significantly extend their work by resolving the case where is -dimensional; degree- polynomials constitute a canonical such example with . In particular, positing only oracle access to , we obtain two main positive results: i) a -time algorithm for computing -approximate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Game Theory and Applications · Reinforcement Learning in Robotics
MethodsSparse Evolutionary Training
