Enhancing Parameter-Free Frank Wolfe with an Extra Subproblem
Bingcong Li, Lingda Wang, Georgios B. Giannakis, Zhizhen Zhao

TL;DR
This paper introduces ExtraFW, a parameter-free Frank Wolfe variant with a prediction-correction update, achieving optimal convergence rates and superior empirical performance in machine learning tasks like classification and matrix completion.
Contribution
The paper proposes ExtraFW, a novel parameter-free Frank Wolfe algorithm with a prediction-correction step, offering improved convergence rates and empirical results over existing methods.
Findings
Achieves ${ m O}(1/k)$ convergence rate for general convex problems.
Attains ${ m O}(1/k^2)$ rate on certain machine learning problems.
Outperforms standard FW and Nesterov's accelerated gradient in experiments.
Abstract
Aiming at convex optimization under structural constraints, this work introduces and analyzes a variant of the Frank Wolfe (FW) algorithm termed ExtraFW. The distinct feature of ExtraFW is the pair of gradients leveraged per iteration, thanks to which the decision variable is updated in a prediction-correction (PC) format. Relying on no problem dependent parameters in the step sizes, the convergence rate of ExtraFW for general convex problems is shown to be , which is optimal in the sense of matching the lower bound on the number of solved FW subproblems. However, the merit of ExtraFW is its faster rate on a class of machine learning problems. Compared with other parameter-free FW variants that have faster rates on the same problems, ExtraFW has improved rates and fine-grained analysis thanks to its PC update. Numerical tests on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and ELM
Methodspc
