Adaptive Newton-CG methods with global and local analysis for unconstrained optimization with H\"older continuous Hessian
Ziyang Zeng, Junyu Zhang, Chuan He

TL;DR
This paper introduces adaptive Newton-CG algorithms for nonconvex optimization with Hölder continuous Hessians, improving efficiency and convergence over existing methods by eliminating nested line searches.
Contribution
The paper proposes two novel adaptive Newton-CG algorithms that leverage auto-conditioning to improve iteration complexity and achieve local superlinear convergence.
Findings
Achieve optimal iteration complexity ${\\mathcal O}(H_f^{1/(1+\nu)}\epsilon^{-(2+\nu)/(1+\nu)})$.
Eliminate the nested line search to enhance computational efficiency.
Demonstrate practical advantages through numerical experiments.
Abstract
In this paper, we study Newton-conjugate gradient (Newton-CG) methods for minimizing a nonconvex function whose Hessian is -H\"older continuous with modulus and exponent . Recently proposed Newton-CG methods for this problem adopt (i) non-adaptive regularization and (ii) a nested line-search procedure, where (i) often leads to inefficient early progress and the loss of local superlinear convergence, and (ii) may incur high computational cost due to multiple solves of the Newton system per iteration. To address these limitations, we propose two novel Newton-CG algorithms, depending on the availability of , that adaptively regularize the Newton system by leveraging the auto-conditioning technique to eliminate the nested line search. The proposed algorithms achieve the best-known iteration complexity ${\mathcal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
