The Landscape of the Proximal Point Method for Nonconvex-Nonconcave   Minimax Optimization

Benjamin Grimmer; Haihao Lu; Pratik Worah; Vahab Mirrokni

arXiv:2006.08667·math.OC·April 2, 2021

The Landscape of the Proximal Point Method for Nonconvex-Nonconcave Minimax Optimization

Benjamin Grimmer, Haihao Lu, Pratik Worah, Vahab Mirrokni

PDF

Open Access

TL;DR

This paper analyzes the proximal point method for nonconvex-nonconcave minimax problems, revealing conditions for convergence and divergence based on the interaction strength between variables, with implications for machine learning applications.

Contribution

It introduces a novel analysis of the proximal point method using a generalized Moreau envelope, identifying three problem regions and their convergence behaviors.

Findings

01

Strong interaction leads to global linear convergence.

02

Weak interaction allows local convergence with proper initialization.

03

Intermediate interaction may cause divergence or limit cycles.

Abstract

Minimax optimization has become a central tool in machine learning with applications in robust optimization, reinforcement learning, GANs, etc. These applications are often nonconvex-nonconcave, but the existing theory is unable to identify and deal with the fundamental difficulties this poses. In this paper, we study the classic proximal point method (PPM) applied to nonconvex-nonconcave minimax problems. We find that a classic generalization of the Moreau envelope by Attouch and Wets provides key insights. Critically, we show this envelope not only smooths the objective but can convexify and concavify it based on the level of interaction present between the minimizing and maximizing variables. From this, we identify three distinct regions of nonconvex-nonconcave problems. When interaction is sufficiently strong, we derive global linear convergence guarantees. Conversely when the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Stochastic Gradient Optimization Techniques