Corrupted Learning Dynamics in Games
Taira Tsuchiya, Shinji Ito, Haipeng Luo

TL;DR
This paper introduces adaptive learning dynamics for games that remain effective even when players deviate from prescribed strategies, ensuring convergence to equilibrium despite corruption or strategic deviations.
Contribution
The paper develops a novel framework for corrupted learning dynamics that adaptively bounds regret and swap regret based on players' deviations, extending existing algorithms to corrupted environments.
Findings
Bounded regret in corrupted two-player zero-sum games.
Bounded swap regret in multi-player general-sum corrupted games.
Framework extends to corruption in utilities and matches existing bounds in honest settings.
Abstract
Learning in games refers to scenarios where multiple players interact in a shared environment, each aiming to minimize their regret. An equilibrium can be computed at a fast rate of when all players follow the optimistic follow-the-regularized-leader (OFTRL). However, this acceleration is limited to the honest regime, in which all players adhere to a prescribed algorithm -- a situation that may not be realistic in practice. To address this issue, we present corrupted learning dynamics that adaptively find an equilibrium at a rate that depends on the extent to which each player deviates from the strategy suggested by the prescribed algorithm. First, in two-player zero-sum corrupted games, we provide learning dynamics for which the external regret of -player (and similarly for -player) is roughly bounded by , where and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Reinforcement Learning in Robotics · Opinion Dynamics and Social Influence
