A Stochastic Objective-Function-Free Adaptive Regularization Method with   Optimal Complexity

Serge Gratton; Sadok Jerad; Philippe L. Toint

arXiv:2407.08018·math.OC·January 22, 2025·Open J. Math. Optim.

A Stochastic Objective-Function-Free Adaptive Regularization Method with Optimal Complexity

Serge Gratton, Sadok Jerad, Philippe L. Toint

PDF

Open Access

TL;DR

This paper introduces a stochastic second-order adaptive-regularization method that avoids computing the objective function, yet guarantees optimal complexity for finding critical points, with applications to inexact derivatives and finite-sum minimization.

Contribution

It presents a novel objective-function-free adaptive regularization method with optimal complexity bounds for nonconvex optimization, tolerant to noise and inexact evaluations.

Findings

01

Achieves $ ilde{O}( ext{epsilon}^{-3/2})$ complexity for critical points.

02

Demonstrates effectiveness on large binary classification problems.

03

Handles inexact derivatives and finite-sum minimization through sampling.

Abstract

A fully stochastic second-order adaptive-regularization method for unconstrained nonconvex optimization is presented which never computes the objective-function value, but yet achieves the optimal $O (ϵ^{- 3/2})$ complexity bound for finding first-order critical points. The method is noise-tolerant and the inexactness conditions required for convergence depend on the history of past steps. Applications to cases where derivative evaluation is inexact and to minimization of finite sums by sampling are discussed. Numerical experiments on large binary classification problems illustrate the potential of the new method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Numerical methods in inverse problems · Neural Networks and Applications