Direct-Search for a Class of Stochastic Min-Max Problems

Sotiris Anagnostidis; Aurelien Lucchi; Youssef Diouane

arXiv:2102.11386·math.OC·April 15, 2021

Direct-Search for a Class of Stochastic Min-Max Problems

Sotiris Anagnostidis, Aurelien Lucchi, Youssef Diouane

PDF

Open Access

TL;DR

This paper introduces a novel derivative-free direct-search algorithm for stochastic min-max problems, proving its convergence under mild conditions, especially when the max-player satisfies the PL condition and the min-player is nonconvex.

Contribution

The work presents the first convergence analysis of a direct-search method for stochastic min-max problems with nonconvex and PL conditions.

Findings

01

Proves convergence of the proposed algorithm under mild assumptions.

02

Handles stochastic settings with dynamically estimated oracle accuracy.

03

Addresses nonconvex min-player and PL max-player scenarios.

Abstract

Recent applications in machine learning have renewed the interest of the community in min-max optimization problems. While gradient-based optimization methods are widely used to solve such problems, there are however many scenarios where these techniques are not well-suited, or even not applicable when the gradient is not accessible. We investigate the use of direct-search methods that belong to a class of derivative-free techniques that only access the objective function through an oracle. In this work, we design a novel algorithm in the context of min-max saddle point games where one sequentially updates the min and the max player. We prove convergence of this algorithm under mild assumptions, where the objective of the max-player satisfies the Polyak-\L{}ojasiewicz (PL) condition, while the min-player is characterized by a nonconvex objective. Our method only assumes dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics