Scalable adaptive cubic regularization methods
Jean-Pierre Dussault, Dominique Orban

TL;DR
This paper introduces ARCqK, a scalable adaptive cubic regularization method that efficiently solves large-scale unconstrained optimization problems by concurrently solving multiple shifted systems using a modified Lanczos-based conjugate gradient approach.
Contribution
The paper presents a novel scalable implementation of ARC that handles multiple shift parameters simultaneously with minimal overhead, enabling large-scale problem solving.
Findings
ARCqK outperforms classic trust-region methods on large problems.
The method requires only one Hessian-vector product per iteration.
Theoretical analysis confirms convergence and complexity bounds.
Abstract
Adaptive cubic regularization (ARC) methods for unconstrained optimization compute steps from linear systems involving a shifted Hessian in the spirit of the Levenberg-Marquardt and trust-region methods. The standard approach consists in performing an iterative search for the shift akin to solving the secular equation in trust-region methods. Such search requires computing the Cholesky factorization of a tentative shifted Hessian at each iteration, which limits the size of problems that can be reasonably considered. We propose a scalable implementation of ARC named ARCqK in which we solve a set of shifted systems concurrently by way of an appropriate modification of the Lanczos formulation of the conjugate gradient (CG) method. At each iteration of ARCqK to solve a problem with n variables, a range of m << n shift parameters is selected. The computational overhead in CG beyond the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
