Adaptive Multilevel Newton: A Quadratically Convergent Optimization Method
Nick Tsipinakis, Panos Parpas, Matthias Voigt

TL;DR
This paper presents an adaptive multilevel Newton method that automatically switches to full Newton steps to achieve quadratic convergence, outperforming traditional methods in strongly convex optimization problems.
Contribution
It introduces an adaptive switching strategy in multilevel Newton methods, ensuring quadratic convergence and improved efficiency over classical approaches.
Findings
The method achieves local quadratic convergence for strongly convex functions.
Empirical results show the method outperforms Newton's and Gradient Descent methods.
The approach is effective despite higher per-iteration costs.
Abstract
Newton's method may exhibit slower convergence than vanilla Gradient Descent in its initial phase on strongly convex problems. Classical Newton-type multilevel methods mitigate this but, like Gradient Descent, achieve only linear convergence near the minimizer. We introduce an adaptive multilevel Newton-type method with a principled automatic switch to full Newton once its quadratic phase is reached. The local quadratic convergence for strongly convex functions with Lipschitz continuous Hessians and for self-concordant functions is established and confirmed empirically. Although per-iteration cost can exceed that of classical multilevel schemes, the method is efficient and consistently outperforms Newton's method, Gradient Descent, and the multilevel Newton method, indicating that second-order methods can outperform first-order methods even when Newton's method is initially slow. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
