Enhancing Parameter-Free Frank Wolfe with an Extra Subproblem

Bingcong Li; Lingda Wang; Georgios B. Giannakis; Zhizhen Zhao

arXiv:2012.05284·math.OC·December 11, 2020·1 cites

Enhancing Parameter-Free Frank Wolfe with an Extra Subproblem

Bingcong Li, Lingda Wang, Georgios B. Giannakis, Zhizhen Zhao

PDF

Open Access 1 Video

TL;DR

This paper introduces ExtraFW, a parameter-free Frank Wolfe variant with a prediction-correction update, achieving optimal convergence rates and superior empirical performance in machine learning tasks like classification and matrix completion.

Contribution

The paper proposes ExtraFW, a novel parameter-free Frank Wolfe algorithm with a prediction-correction step, offering improved convergence rates and empirical results over existing methods.

Findings

01

Achieves ${ m O}(1/k)$ convergence rate for general convex problems.

02

Attains ${ m O}(1/k^2)$ rate on certain machine learning problems.

03

Outperforms standard FW and Nesterov's accelerated gradient in experiments.

Abstract

Aiming at convex optimization under structural constraints, this work introduces and analyzes a variant of the Frank Wolfe (FW) algorithm termed ExtraFW. The distinct feature of ExtraFW is the pair of gradients leveraged per iteration, thanks to which the decision variable is updated in a prediction-correction (PC) format. Relying on no problem dependent parameters in the step sizes, the convergence rate of ExtraFW for general convex problems is shown to be $O (\frac{1}{k})$ , which is optimal in the sense of matching the lower bound on the number of solved FW subproblems. However, the merit of ExtraFW is its faster rate $O (\frac{1}{k ^{2}})$ on a class of machine learning problems. Compared with other parameter-free FW variants that have faster rates on the same problems, ExtraFW has improved rates and fine-grained analysis thanks to its PC update. Numerical tests on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Enhancing Parameter-Free Frank Wolfe with an Extra Subproblem· underline

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM

Methodspc