Fast convergence of Frank-Wolfe algorithms on polytopes

Elias Wirth; Javier Pena; and Sebastian Pokutta

arXiv:2406.18789·math.OC·May 21, 2025

Fast convergence of Frank-Wolfe algorithms on polytopes

Elias Wirth, Javier Pena, and Sebastian Pokutta

PDF

Open Access

TL;DR

This paper introduces a unified framework to derive convergence rates for various Frank-Wolfe algorithms on polytopes, based on affine-invariant properties like error bounds and curvature.

Contribution

It provides a template linking convergence rates to affine-invariant properties, applicable to multiple Frank-Wolfe variants on polytopes, regardless of norms.

Findings

01

Derived convergence rates from sublinear to linear for different algorithms.

02

Rates depend solely on polytope and objective function properties.

03

Unified analysis simplifies understanding of Frank-Wolfe algorithm performance.

Abstract

We provide a template to derive convergence rates for the following popular versions of the Frank-Wolfe algorithm on polytopes: vanilla Frank-Wolfe, Frank-Wolfe with away steps, Frank-Wolfe with blended pairwise steps, and Frank-Wolfe with in-face directions. Our template shows how the convergence rates follow from two affine-invariant properties of the problem, namely, error bound and extended curvature. These properties depend solely on the polytope and objective function but not on any affine-dependent object like norms. For each one of the above algorithms, we derive rates of convergence ranging from sublinear to linear depending on the degree of the error bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research