Negative Curvature Methods with High-Probability Complexity Guarantees for Stochastic Nonconvex Optimization
Albert S. Berahas, Raghu Bollapragada, Wanping Dong

TL;DR
This paper introduces a probabilistic framework for negative curvature methods in stochastic nonconvex optimization, providing high-probability complexity guarantees and demonstrating practical benefits through experiments.
Contribution
It develops a novel two-step framework combining gradient and negative curvature steps with probabilistic oracles, achieving high-probability convergence guarantees.
Findings
High-probability iteration complexity bounds are established.
The framework recovers deterministic rates in the noiseless case.
Numerical experiments show practical advantages of negative curvature directions.
Abstract
This paper develops negative curvature methods for continuous nonlinear unconstrained optimization in stochastic settings, in which function, gradient, and Hessian information is available only through probabilistic oracles, i.e., oracles that return approximations of a certain accuracy and reliability. We introduce conditions on these oracles and design a two-step framework that systematically combines gradient and negative curvature steps. The framework employs an early-stopping mechanism to guarantee sufficient progress and uses an adaptive mechanism based on an Armijo-type criterion to select the step sizes for both steps. We establish high-probability iteration-complexity guarantees for attaining second-order stationary points, deriving explicit tail bounds that quantify the convergence neighborhood and its dependence on oracle noise. Importantly, these bounds match deterministic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Risk and Portfolio Optimization · Optimization and Variational Analysis
