A Stochastic Objective-Function-Free Adaptive Regularization Method with Optimal Complexity
Serge Gratton, Sadok Jerad, Philippe L. Toint

TL;DR
This paper introduces a stochastic second-order adaptive-regularization method that avoids computing the objective function, yet guarantees optimal complexity for finding critical points, with applications to inexact derivatives and finite-sum minimization.
Contribution
It presents a novel objective-function-free adaptive regularization method with optimal complexity bounds for nonconvex optimization, tolerant to noise and inexact evaluations.
Findings
Achieves $ ilde{O}( ext{epsilon}^{-3/2})$ complexity for critical points.
Demonstrates effectiveness on large binary classification problems.
Handles inexact derivatives and finite-sum minimization through sampling.
Abstract
A fully stochastic second-order adaptive-regularization method for unconstrained nonconvex optimization is presented which never computes the objective-function value, but yet achieves the optimal complexity bound for finding first-order critical points. The method is noise-tolerant and the inexactness conditions required for convergence depend on the history of past steps. Applications to cases where derivative evaluation is inexact and to minimization of finite sums by sampling are discussed. Numerical experiments on large binary classification problems illustrate the potential of the new method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Numerical methods in inverse problems · Neural Networks and Applications
