Gradient Regularization of Newton Method with Bregman Distances

Nikita Doikov; Yurii Nesterov

arXiv:2112.02952·math.OC·December 7, 2021·Math. Program.

Gradient Regularization of Newton Method with Bregman Distances

Nikita Doikov, Yurii Nesterov

PDF

Open Access

TL;DR

This paper introduces a non-Euclidean Newton method with Bregman distances, achieving improved convergence rates for composite optimization problems, including adaptive and accelerated variants.

Contribution

It develops a first second-order scheme using Bregman distances with proven convergence rates, relaxing cubic regularization while maintaining efficiency.

Findings

01

Global convergence rate of O(k^{-2}) for the basic scheme

02

Linear convergence for uniformly convex functions of degree three

03

Accelerated scheme with convergence rate O(k^{-3})

Abstract

In this paper, we propose a first second-order scheme based on arbitrary non-Euclidean norms, incorporated by Bregman distances. They are introduced directly in the Newton iterate with regularization parameter proportional to the square root of the norm of the current gradient. For the basic scheme, as applied to the composite optimization problem, we establish the global convergence rate of the order $O (k^{- 2})$ both in terms of the functional residual and in the norm of subgradients. Our main assumption on the smooth part of the objective is Lipschitz continuity of its Hessian. For uniformly convex functions of degree three, we justify global linear rate, and for strongly convex function we prove the local superlinear rate of convergence. Our approach can be seen as a relaxation of the Cubic Regularization of the Newton method, which preserves its convergence properties, while the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques · Optimization and Variational Analysis