A Subsampling Line-Search Method with Second-Order Results
El-houcine Bergou, Youssef Diouane, Vladimir Kunc, Vyacheslav, Kungurtsev, Cl\'ement W. Royer

TL;DR
This paper introduces a stochastic line-search optimization method using subsampling and second-order information, providing convergence guarantees and practical efficiency for large-scale nonconvex problems like deep learning.
Contribution
It develops a subsampling-based second-order line-search algorithm with probabilistic guarantees, addressing challenges in nonconvex stochastic optimization.
Findings
Method achieves second-order stationarity efficiently.
Provides worst-case complexity guarantees.
Performs competitively with first-order methods in practice.
Abstract
In many contemporary optimization problems such as those arising in machine learning, it can be computationally challenging or even infeasible to evaluate an entire function or its derivatives. This motivates the use of stochastic algorithms that sample problem data, which can jeopardize the guarantees obtained through classical globalization techniques in optimization such as a trust region or a line search. Using subsampled function values is particularly challenging for the latter strategy, which relies upon multiple evaluations. On top of that all, there has been an increasing interest for nonconvex formulations of data-related problems, such as training deep learning models. For such instances, one aims at developing methods that converge to second-order stationary points quickly, i.e., escape saddle points efficiently. This is particularly delicate to ensure when one only accesses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · 3D Shape Modeling and Analysis
