Scalable adaptive cubic regularization methods

Jean-Pierre Dussault; Dominique Orban

arXiv:2103.16659·math.OC·April 1, 2021·Math. Program.

Scalable adaptive cubic regularization methods

Jean-Pierre Dussault, Dominique Orban

PDF

TL;DR

This paper introduces ARCqK, a scalable adaptive cubic regularization method that efficiently solves large-scale unconstrained optimization problems by concurrently solving multiple shifted systems using a modified Lanczos-based conjugate gradient approach.

Contribution

The paper presents a novel scalable implementation of ARC that handles multiple shift parameters simultaneously with minimal overhead, enabling large-scale problem solving.

Findings

01

ARCqK outperforms classic trust-region methods on large problems.

02

The method requires only one Hessian-vector product per iteration.

03

Theoretical analysis confirms convergence and complexity bounds.

Abstract

Adaptive cubic regularization (ARC) methods for unconstrained optimization compute steps from linear systems involving a shifted Hessian in the spirit of the Levenberg-Marquardt and trust-region methods. The standard approach consists in performing an iterative search for the shift akin to solving the secular equation in trust-region methods. Such search requires computing the Cholesky factorization of a tentative shifted Hessian at each iteration, which limits the size of problems that can be reasonably considered. We propose a scalable implementation of ARC named ARCqK in which we solve a set of shifted systems concurrently by way of an appropriate modification of the Lanczos formulation of the conjugate gradient (CG) method. At each iteration of ARCqK to solve a problem with n variables, a range of m << n shift parameters is selected. The computational overhead in CG beyond the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.