Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics

Deep Patel; Emmanouil-Vasileios Vlatakis-Gkaragkounis

arXiv:2512.00389·cs.LG·December 2, 2025

Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics

Deep Patel, Emmanouil-Vasileios Vlatakis-Gkaragkounis

PDF

Open Access 1 Video

TL;DR

This paper provides a theoretical framework explaining why simple gradient methods often converge in neural min-max games, highlighting the roles of architecture, initialization, and overparameterization in guaranteeing global convergence.

Contribution

It introduces the first theoretical results for convergence in two-layer neural network min-max games, linking hidden convexity and overparameterization to convergence guarantees.

Findings

01

Global convergence to Nash equilibrium under certain conditions

02

Path-length bounds for gradient descent-ascent in min-max games

03

High-probability reduction to convex-concave geometry via overparameterization

Abstract

Many emerging applications - such as adversarial training, AI alignment, and robust optimization - can be framed as zero-sum games between neural nets, with von Neumann-Nash equilibria (NE) capturing the desirable system behavior. While such games often involve non-convex non-concave objectives, empirical evidence shows that simple gradient methods frequently converge, suggesting a hidden geometric structure. In this paper, we provide a theoretical framework that explains this phenomenon through the lens of hidden convexity and overparameterization. We identify sufficient conditions - spanning initialization, training dynamics, and network width - that guarantee global convergence to a NE in a broad class of non-convex min-max games. To our knowledge, this is the first such result for games that involve two-layer neural networks. Technically, our approach is twofold: (a) we derive a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Solving Neural Min-Max Games: The Role of Architecture, Initialization & Dynamics· slideslive

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks