On the global convergence of randomized coordinate gradient descent for   non-convex optimization

Ziang Chen; Yingzhou Li; Jianfeng Lu

arXiv:2101.01323·math.OC·December 1, 2022·1 cites

On the global convergence of randomized coordinate gradient descent for non-convex optimization

Ziang Chen, Yingzhou Li, Jianfeng Lu

PDF

Open Access

TL;DR

This paper proves that randomized coordinate gradient descent almost surely avoids strict saddle points and converges to local minima in non-convex optimization, under broad assumptions.

Contribution

It provides the first rigorous analysis showing global convergence to local minima for coordinate descent in non-convex problems with random coordinate selection.

Findings

01

Algorithm almost surely escapes strict saddle points

02

Converges to local minima under generic conditions

03

Analysis based on nonlinear random dynamical systems

Abstract

In this work, we analyze the global convergence property of coordinate gradient descent with random choice of coordinates and stepsizes for non-convex optimization problems. Under generic assumptions, we prove that the algorithm iterate will almost surely escape strict saddle points of the objective function. As a result, the algorithm is guaranteed to converge to local minima if all saddle points are strict. Our proof is based on viewing coordinate descent algorithm as a nonlinear random dynamical system and a quantitative finite block analysis of its linearization around saddle points.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Topological and Geometric Data Analysis