Faster Newton Methods for Convex and Nonconvex Optimization in Gradient Complexity

Lesi Chen; Chengchang Liu; Luo Luo; Jingzhao Zhang

arXiv:2501.17488·math.OC·January 26, 2026

Faster Newton Methods for Convex and Nonconvex Optimization in Gradient Complexity

Lesi Chen, Chengchang Liu, Luo Luo, Jingzhao Zhang

PDF

Open Access

TL;DR

This paper introduces faster second-order optimization methods that significantly reduce gradient complexity for large-scale convex and nonconvex problems, advancing the state-of-the-art in efficiency.

Contribution

The authors propose new methods that improve gradient complexity bounds for both convex and nonconvex optimization, surpassing recent results in the field.

Findings

01

Achieved gradient complexity of O(d + d^{1/3} ε^{-3/2}) for nonconvex optimization.

02

Achieved gradient complexity of O((d + d^{13/21} ε^{-2/7}) log d) for convex optimization.

03

Improved the theoretical bounds compared to previous methods.

Abstract

Second-order optimization methods are computationally expensive for large-scale problems. Recently, Doikov, Chayti, and Jaggi (ICML 2023) proposed the LazyCRN method that reduces computation by studying the gradient complexity of second-order methods. Their method can achieve a gradient complexity of $O (d + d^{1/2} ϵ^{- 3/2})$ and $O (d + d^{1/2} ϵ^{- 1/2})$ for nonconvex and convex optimization, respectively, where $d$ is the effective dimension and $ϵ$ is the target precision. Very recently, Adil, Bullins, Sidford, and Zhang (NeurIPS 2025) improved the gradient complexity to $O (d + d^{1/3} ϵ^{- 3/2} ln^{18} ϵ^{- 1})$ for nonconvex optimization. However, the tightness of these methods remains open. In this work, we propose new methods that achieve an improved complexity of $O (d + d^{1/3} ϵ^{- 3/2})$ and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Numerical Methods and Algorithms · Matrix Theory and Algorithms