Adaptive Third-Order Methods for Composite Convex Optimization
Geovani Nunes Grapiglia, Yurii Nesterov

TL;DR
This paper introduces adaptive third-order methods for composite convex optimization that do not require prior knowledge of Lipschitz constants, achieving improved iteration complexity for finding approximate minimizers.
Contribution
The paper develops novel adaptive third-order optimization algorithms with dynamic regularization parameter adjustment, enhancing efficiency over existing high-order methods.
Findings
Basic method finds an ε-approximate minimizer with O(|log(ε)| ε^{-1/3}) inner iterations.
Accelerated method reduces total inner iterations to O(|log(ε)| ε^{-1/4}).
Methods do not require prior Lipschitz constant knowledge.
Abstract
In this paper we propose third-order methods for composite convex optimization problems in which the smooth part is a three-times continuously differentiable function with Lipschitz continuous third-order derivatives. The methods are adaptive in the sense that they do not require the knowledge of the Lipschitz constant. Trial points are computed by the inexact minimization of models that consist in the nonsmooth part of the objective plus a quartic regularization of third-order Taylor polynomial of the smooth part. Specifically, approximate solutions of the auxiliary problems are obtained by using a Bregman gradient method as inner solver. Different from existing adaptive approaches for high-order methods, in our new schemes the regularization parameters are adjusted taking into account the progress of the inner solver. With this technique, we show that the basic method finds an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Stochastic Gradient Optimization Techniques
