Super-Universal Regularized Newton Method
Nikita Doikov, Konstantin Mishchenko, Yurii Nesterov

TL;DR
This paper introduces an adaptive regularized Newton method that automatically adjusts to different problem classes, achieving optimal convergence rates including superlinear convergence without prior parameter knowledge.
Contribution
It presents a universal regularized Newton method with adaptive parameter tuning, achieving optimal convergence rates across various problem classes without prior knowledge of problem parameters.
Findings
Achieves $O(1/k^3)$ rate for functions with Lipschitz third derivative.
Automatically accelerates for uniformly convex functions, with superlinear convergence.
No prior knowledge of problem parameters is needed for rate adaptation.
Abstract
We analyze the performance of a variant of Newton method with quadratic regularization for solving composite convex minimization problems. At each step of our method, we choose regularization parameter proportional to a certain power of the gradient norm at the current point. We introduce a family of problem classes characterized by H\"older continuity of either the second or third derivative. Then we present the method with a simple adaptive search procedure allowing an automatic adjustment to the problem class with the best global complexity bounds, without knowing specific parameters of the problem. In particular, for the class of functions with Lipschitz continuous third derivative, we get the global rate, which was previously attributed to third-order tensor methods. When the objective function is uniformly convex, we justify an automatic acceleration of our scheme,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Optimization and Variational Analysis
