On the Exploitability of FTRL Dynamics
Yiheng Su, Emmanouil-Vasileios Vlatakis-Gkaragkounis

TL;DR
This paper analyzes the inherent exploitability of FTRL algorithms in two-player zero-sum games, revealing how regularizer steepness influences vulnerability to strategic exploitation.
Contribution
It demonstrates that exploitability is intrinsic to FTRL, quantifies how it scales with suboptimal actions and game parameters, and introduces a measure of vulnerability to manipulation.
Findings
Exploitability scales with the number of suboptimal actions as Ω(N/η).
Surplus can be guaranteed regardless of equilibrium structure in random games.
Steep regularizers delay exploitation, while non-steep ones enable maximum surplus extraction.
Abstract
In this paper we investigate the exploitability of a Follow-the-Regularized-Leader (FTRL) learner with constant step size in two-player zero-sum games played over rounds against a clairvoyant optimizer. In contrast with prior analysis, we show that exploitability is an inherent feature of the FTRL family, rather than an artifact of specific instantiations. First, for fixed optimizer, we establish a sweeping law of order , proving that exploitation scales to the number of the learner's suboptimal actions and vanishes in their absence. Second, for alternating optimizer, a surplus of can be guaranteed regardless of the equilibrium structure, with high probability, in random games. Our analysis uncovers once more the sharp geometric dichotomy: non-steep regularizers allow the optimizer to extract maximum surplus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
