Frank-Wolfe with Subsampling Oracle

Thomas Kerdreux; Fabian Pedregosa; Alexandre d'Aspremont

arXiv:1803.07348·math.OC·March 21, 2018·1 cites

Frank-Wolfe with Subsampling Oracle

Thomas Kerdreux, Fabian Pedregosa, Alexandre d'Aspremont

PDF

Open Access

TL;DR

This paper introduces two randomized variants of the Frank-Wolfe algorithm that use subsampling to reduce computational costs, achieving comparable convergence rates and demonstrating efficiency in regression tasks with structured penalties.

Contribution

It presents the first provably convergent randomized variants of the Away-step Frank-Wolfe algorithm, combining subsampling with accelerated convergence.

Findings

01

Subsampling reduces the linear minimization cost significantly.

02

The randomized algorithms maintain convergence rates similar to deterministic versions.

03

Empirical results show computational gains in regression problems with structured penalties.

Abstract

We analyze two novel randomized variants of the Frank-Wolfe (FW) or conditional gradient algorithm. While classical FW algorithms require solving a linear minimization problem over the domain at each iteration, the proposed method only requires to solve a linear minimization problem over a small \emph{subset} of the original domain. The first algorithm that we propose is a randomized variant of the original FW algorithm and achieves a $O (1/ t)$ sublinear convergence rate as in the deterministic counterpart. The second algorithm is a randomized variant of the Away-step FW algorithm, and again as its deterministic counterpart, reaches linear (i.e., exponential) convergence rate making it the first provably convergent randomized variant of Away-step FW. In both cases, while subsampling reduces the convergence rate by a constant factor, the linear minimization step can be a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Machine Learning and ELM