Adaptive Regularized Newton Method with Inexact Hessian
Aleksandr Shestakov, Nail Bashirov, Andrei Semenov, Alexander Gasnikov, Martin Tak\'a\v{c}, Aleksandr Beznosikov, Dmitry Kamzolov

TL;DR
This paper introduces an adaptive regularized Newton method that uses inexact Hessians and Bregman divergences, achieving global convergence and reduced computational costs for both convex and nonconvex optimization problems.
Contribution
It proposes a flexible Newton-type algorithm that incorporates inexact Hessians and Bregman divergences, improving efficiency and applicability in machine learning tasks.
Findings
Method converges globally for convex and nonconvex problems.
Achieves theoretical convergence rates comparable to existing methods.
Demonstrates competitive performance in experiments.
Abstract
Newton's method is the most widespread high-order method, demanding the gradient and the Hessian of the objective function. However, one of the main disadvantages of Newtons method is its lack of global convergence and high iteration cost. Both these drawbacks are critical for modern optimization motivated primarily by current applications in machine learning. In this paper, we introduce a novel algorithm to deal with these disadvantages. Our method can be implemented with various Hessian approximations, including methods that use only the first-order information. Thus, computational costs might be drastically reduced. Also, it can be adjusted to problems' geometries via the usage of different Bregman divergences. The proposed method converges for nonconvex and convex problems globally and it has the same rates as other well-known methods that lack mentioned properties. We present…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research · Sparse and Compressive Sensing Techniques
