First- and Second-Order Stochastic Adaptive Regularization with Cubics:   High Probability Iteration and Sample Complexity

Katya Scheinberg; Miaolan Xie

arXiv:2308.13161·math.OC·April 23, 2025·1 cites

First- and Second-Order Stochastic Adaptive Regularization with Cubics: High Probability Iteration and Sample Complexity

Katya Scheinberg, Miaolan Xie

PDF

Open Access

TL;DR

This paper establishes high-probability iteration and sample complexity bounds for stochastic adaptive regularization with cubics, improving understanding of their efficiency in finding first- and second-order stationary points.

Contribution

It provides the first high-probability complexity bounds for stochastic adaptive regularization methods with cubics targeting both first- and second-order optimality.

Findings

01

High-probability bounds outperform other stochastic methods.

02

Applicable to various optimization settings including risk minimization.

03

Demonstrates efficiency of SARC methods in stochastic settings.

Abstract

We present high-probability (and expectation) complexity bounds for two versions of stochastic adaptive regularization methods with cubics (SARC), also known as regularized Newton methods. The first algorithm aims to find first-order stationary points, while the second targets second-order optimality conditions. Both methods employ stochastic zeroth-, first-, and second-order oracles with specific accuracy and reliability requirements. These oracles, which have been previously used with other stochastic adaptive methods like trust-region and line-search algorithms, are applicable to various optimization settings including expected risk minimization and simulation optimization. In this paper, we establish the first high-probability iteration and sample complexity bounds for both first- and second-order SARC algorithms. Our analysis demonstrates that as in the deterministic case, they…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference