Recent Theoretical Advances in Non-Convex Optimization

Marina Danilova; Pavel Dvurechensky; Alexander Gasnikov; Eduard; Gorbunov; Sergey Guminov; Dmitry Kamzolov; Innokentiy Shibaev

arXiv:2012.06188·math.OC·November 29, 2021·5 cites

Recent Theoretical Advances in Non-Convex Optimization

Marina Danilova, Pavel Dvurechensky, Alexander Gasnikov, Eduard, Gorbunov, Sergey Guminov, Dmitry Kamzolov, Innokentiy Shibaev

PDF

Open Access

TL;DR

This paper reviews recent theoretical advances in non-convex optimization, focusing on convergence guarantees of various algorithms and problem classes relevant to deep learning and data analysis.

Contribution

It provides a comprehensive overview of recent theoretical results, including convergence rates and problem structures that enable efficient solutions in non-convex optimization.

Findings

01

Efficient algorithms exist for structured non-convex problems.

02

Convergence rates are established for first-order and stochastic methods.

03

Certain classes like Polyak--Lojasiewicz functions allow guaranteed convergence.

Abstract

Motivated by recent increased interest in optimization algorithms for non-convex optimization in application to training deep neural networks and other optimization problems in data analysis, we give an overview of recent theoretical results on global performance guarantees of optimization algorithms for non-convex optimization. We start with classical arguments showing that general non-convex problems could not be solved efficiently in a reasonable time. Then we give a list of problems that can be solved efficiently to find the global minimizer by exploiting the structure of the problem as much as it is possible. Another way to deal with non-convexity is to relax the goal from finding the global minimum to finding a stationary point or a local minimum. For this setting, we first present known results for the convergence rates of deterministic first-order methods, which are then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research