Online Optimization in Games via Control Theory: Connecting Regret, Passivity and Poincar\'e Recurrence
Yun Kuen Cheung, Georgios Piliouras

TL;DR
This paper introduces a control-theoretic framework for online optimization in games, showing that passivity and energy conservation in dynamics lead to bounded regret and recurrent behaviors, extending stability analysis to Poincaré recurrence.
Contribution
It establishes a novel connection between passivity in control theory and bounded regret in online game learning, introducing lossless dynamics and energy-preserving behaviors.
Findings
All continuous-time FTRL dynamics are lossless and passive.
Convex combinations of FTRL dynamics are also lossless with bounded regret.
Lossless game dynamics can exhibit Poincaré recurrence, indicating recurrent behaviors.
Abstract
We present a novel control-theoretic understanding of online optimization and learning in games, via the notion of passivity. Passivity is a fundamental concept in control theory, which abstracts energy conservation and dissipation in physical systems. It has become a standard tool in analysis of general feedback systems, to which game dynamics belong. Our starting point is to show that all continuous-time Follow-the-Regularized-Leader (FTRL) dynamics, which include the well-known Replicator Dynamic, are lossless, i.e. it is passive with no energy dissipation. Interestingly, we prove that passivity implies bounded regret, connecting two fundamental primitives of control theory and online optimization. The observation of energy conservation in FTRL inspires us to present a family of lossless learning dynamics, each of which has an underlying energy function with a simple gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Mathematical Biology Tumor Growth
