Teaching an Old Dynamics New Tricks: Regularization-free Last-iterate Convergence in Zero-sum Games via BNN Dynamics

Tuo Zhang; Leonardo Stella

arXiv:2602.08938·cs.MA·February 10, 2026

Teaching an Old Dynamics New Tricks: Regularization-free Last-iterate Convergence in Zero-sum Games via BNN Dynamics

Tuo Zhang, Leonardo Stella

PDF

Open Access

TL;DR

This paper introduces a regularization-free method using Brown-von Neumann-Nash dynamics for last-iterate convergence in zero-sum games, applicable to both normal-form and extensive-form games with neural approximation, outperforming existing approaches.

Contribution

It repurposes classical BNN dynamics for zero-sum games, providing convergence guarantees without regularization and developing a scalable neural implementation for complex game settings.

Findings

01

Achieves last-iterate convergence in noisy normal-form games.

02

Outperforms regularization-based methods in empirical tests.

03

Adapts quickly to nonstationarities in game environments.

Abstract

Zero-sum games are a fundamental setting for adversarial training and decision-making in multi-agent learning (MAL). Existing methods often ensure convergence to (approximate) Nash equilibria by introducing a form of regularization. Yet, regularization requires additional hyperparameters, which must be carefully tuned--a challenging task when the payoff structure is known, and considerably harder when the structure is unknown or subject to change. Motivated by this problem, we repurpose a classical model in evolutionary game theory, i.e., the Brown-von Neumann-Nash (BNN) dynamics, by leveraging the intrinsic convergence of this dynamics in zero-sum games without regularization, and provide last-iterate convergence guarantees in noisy normal-form games (NFGs). Importantly, to make this approach more applicable, we develop a novel framework with theoretical guarantees that integrates the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics