Projection-Free Algorithms in Statistical Estimation
Yan Li, Chao Qu, Huan Xu

TL;DR
This paper extends projection-free Frank-Wolfe algorithms to high-dimensional statistical estimation problems with non-strongly convex objectives, achieving logarithmic gradient evaluation complexity under restricted strong convexity.
Contribution
It demonstrates that FW-type algorithms can attain log complexity in high-dimensional, non-strongly convex settings with restricted strong convexity, broadening their applicability.
Findings
Achieves $ ext{log}(rac{1}{ ext{epsilon}})$ gradient complexity in certain non-convex settings.
Extends FW algorithms to high-dimensional, large-scale estimation problems.
Shows effectiveness under restricted strong convexity conditions.
Abstract
Frank-Wolfe algorithm (FW) and its variants have gained a surge of interests in machine learning community due to its projection-free property. Recently people have reduced the gradient evaluation complexity of FW algorithm to for the smooth and strongly convex objective. This complexity result is especially significant in learning problem, as the overwhelming data size makes a single evluation of gradient computational expensive. However, in high-dimensional statistical estimation problems, the objective is typically not strongly convex, and sometimes even non-convex. In this paper, we extend the state-of-the-art FW type algorithms for the large-scale, high-dimensional estimation problem. We show that as long as the objective satisfies {\em restricted strong convexity}, and we are not optimizing over statistical limit of the model, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms
