Teaching an Old Dynamics New Tricks: Regularization-free Last-iterate Convergence in Zero-sum Games via BNN Dynamics
Tuo Zhang, Leonardo Stella

TL;DR
This paper introduces a regularization-free method using Brown-von Neumann-Nash dynamics for last-iterate convergence in zero-sum games, applicable to both normal-form and extensive-form games with neural approximation, outperforming existing approaches.
Contribution
It repurposes classical BNN dynamics for zero-sum games, providing convergence guarantees without regularization and developing a scalable neural implementation for complex game settings.
Findings
Achieves last-iterate convergence in noisy normal-form games.
Outperforms regularization-based methods in empirical tests.
Adapts quickly to nonstationarities in game environments.
Abstract
Zero-sum games are a fundamental setting for adversarial training and decision-making in multi-agent learning (MAL). Existing methods often ensure convergence to (approximate) Nash equilibria by introducing a form of regularization. Yet, regularization requires additional hyperparameters, which must be carefully tuned--a challenging task when the payoff structure is known, and considerably harder when the structure is unknown or subject to change. Motivated by this problem, we repurpose a classical model in evolutionary game theory, i.e., the Brown-von Neumann-Nash (BNN) dynamics, by leveraging the intrinsic convergence of this dynamics in zero-sum games without regularization, and provide last-iterate convergence guarantees in noisy normal-form games (NFGs). Importantly, to make this approach more applicable, we develop a novel framework with theoretical guarantees that integrates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
