Faster Newton Methods for Convex and Nonconvex Optimization in Gradient Complexity
Lesi Chen, Chengchang Liu, Luo Luo, Jingzhao Zhang

TL;DR
This paper introduces faster second-order optimization methods that significantly reduce gradient complexity for large-scale convex and nonconvex problems, advancing the state-of-the-art in efficiency.
Contribution
The authors propose new methods that improve gradient complexity bounds for both convex and nonconvex optimization, surpassing recent results in the field.
Findings
Achieved gradient complexity of O(d + d^{1/3} ε^{-3/2}) for nonconvex optimization.
Achieved gradient complexity of O((d + d^{13/21} ε^{-2/7}) log d) for convex optimization.
Improved the theoretical bounds compared to previous methods.
Abstract
Second-order optimization methods are computationally expensive for large-scale problems. Recently, Doikov, Chayti, and Jaggi (ICML 2023) proposed the LazyCRN method that reduces computation by studying the gradient complexity of second-order methods. Their method can achieve a gradient complexity of and for nonconvex and convex optimization, respectively, where is the effective dimension and is the target precision. Very recently, Adil, Bullins, Sidford, and Zhang (NeurIPS 2025) improved the gradient complexity to for nonconvex optimization. However, the tightness of these methods remains open. In this work, we propose new methods that achieve an improved complexity of and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Numerical Methods and Algorithms · Matrix Theory and Algorithms
