Gradient Regularization of Newton Method with Bregman Distances
Nikita Doikov, Yurii Nesterov

TL;DR
This paper introduces a non-Euclidean Newton method with Bregman distances, achieving improved convergence rates for composite optimization problems, including adaptive and accelerated variants.
Contribution
It develops a first second-order scheme using Bregman distances with proven convergence rates, relaxing cubic regularization while maintaining efficiency.
Findings
Global convergence rate of O(k^{-2}) for the basic scheme
Linear convergence for uniformly convex functions of degree three
Accelerated scheme with convergence rate O(k^{-3})
Abstract
In this paper, we propose a first second-order scheme based on arbitrary non-Euclidean norms, incorporated by Bregman distances. They are introduced directly in the Newton iterate with regularization parameter proportional to the square root of the norm of the current gradient. For the basic scheme, as applied to the composite optimization problem, we establish the global convergence rate of the order both in terms of the functional residual and in the norm of subgradients. Our main assumption on the smooth part of the objective is Lipschitz continuity of its Hessian. For uniformly convex functions of degree three, we justify global linear rate, and for strongly convex function we prove the local superlinear rate of convergence. Our approach can be seen as a relaxation of the Cubic Regularization of the Newton method, which preserves its convergence properties, while the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Optimization and Variational Analysis
