Convergence Rate of Frank-Wolfe for Non-Convex Objectives

Simon Lacoste-Julien

arXiv:1607.00345·math.OC·July 7, 2016·104 cites

Convergence Rate of Frank-Wolfe for Non-Convex Objectives

Simon Lacoste-Julien

PDF

Open Access

TL;DR

This paper proves that the Frank-Wolfe algorithm converges to a stationary point at a rate of O(1/√t) for non-convex functions with Lipschitz continuous gradients, matching known rates for projected gradient methods.

Contribution

It provides a simple, affine-invariant proof establishing the convergence rate of Frank-Wolfe for non-convex objectives, a novel result in the field.

Findings

01

Frank-Wolfe achieves O(1/√t) convergence rate on non-convex problems.

02

The proof is affine invariant and straightforward.

03

First known rate similar to projected gradient methods for this setting.

Abstract

We give a simple proof that the Frank-Wolfe algorithm obtains a stationary point at a rate of $O (1/ t)$ on non-convex objectives with a Lipschitz continuous gradient. Our analysis is affine invariant and is the first, to the best of our knowledge, giving a similar rate to what was already proven for projected gradient methods (though on slightly different measures of stationarity).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Markov Chains and Monte Carlo Methods