An acceleration procedure for optimal first-order methods
Michel Baes, Michael Buergisser

TL;DR
This paper presents an improved optimal first-order method that efficiently estimates the local Lipschitz constant, enabling faster convergence and significant computational savings in large-scale eigenvalue minimization problems.
Contribution
The authors introduce a novel acceleration procedure that allows setting the Lipschitz constant optimally at each iteration with maintained worst-case complexity.
Findings
Reduces computation time by up to three orders of magnitude on large problems.
Provides an adaptable scheme for smoothing techniques.
Maintains optimal worst-case complexity despite practical efficiency improvements.
Abstract
We introduce in this paper an optimal first-order method that allows an easy and cheap evaluation of the local Lipschitz constant of the objective's gradient. This constant must ideally be chosen at every iteration as small as possible, while serving in an indispensable upper bound for the value of the objective function. In the previously existing variants of optimal first-order methods, this upper bound inequality was constructed from points computed during the current iteration. It was thus not possible to select the optimal value for this Lipschitz constant at the beginning of the iteration. In our variant, the upper bound inequality is constructed from points available before the current iteration, offering us the possibility to set the Lipschitz constant to its optimal value at once. This procedure, even if efficient in practice, presents a higher worse-case complexity than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Numerical methods in inverse problems
