Corrupted Learning Dynamics in Games

Taira Tsuchiya; Shinji Ito; Haipeng Luo

arXiv:2412.07120·cs.GT·February 18, 2025

Corrupted Learning Dynamics in Games

Taira Tsuchiya, Shinji Ito, Haipeng Luo

PDF

Open Access

TL;DR

This paper introduces adaptive learning dynamics for games that remain effective even when players deviate from prescribed strategies, ensuring convergence to equilibrium despite corruption or strategic deviations.

Contribution

The paper develops a novel framework for corrupted learning dynamics that adaptively bounds regret and swap regret based on players' deviations, extending existing algorithms to corrupted environments.

Findings

01

Bounded regret in corrupted two-player zero-sum games.

02

Bounded swap regret in multi-player general-sum corrupted games.

03

Framework extends to corruption in utilities and matches existing bounds in honest settings.

Abstract

Learning in games refers to scenarios where multiple players interact in a shared environment, each aiming to minimize their regret. An equilibrium can be computed at a fast rate of $O (1/ T)$ when all players follow the optimistic follow-the-regularized-leader (OFTRL). However, this acceleration is limited to the honest regime, in which all players adhere to a prescribed algorithm -- a situation that may not be realistic in practice. To address this issue, we present corrupted learning dynamics that adaptively find an equilibrium at a rate that depends on the extent to which each player deviates from the strategy suggested by the prescribed algorithm. First, in two-player zero-sum corrupted games, we provide learning dynamics for which the external regret of $x$ -player (and similarly for $y$ -player) is roughly bounded by $O (lo g (m_{x} m_{y}) + \hat{C}_{y} + \hat{C}_{x})$ , where $m_{x}$ and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Reinforcement Learning in Robotics · Opinion Dynamics and Social Influence