Balancing Gradient and Hessian Queries in Non-Convex Optimization
Deeksha Adil, Brian Bullins, Aaron Sidford, Chenyi Zhang

TL;DR
This paper introduces new optimization algorithms that balance the number of gradient and Hessian computations to efficiently find critical points in non-convex functions, improving query complexities under various conditions.
Contribution
It presents a novel method that offers flexible trade-offs between gradient and Hessian queries for non-convex optimization, including bounds with approximate Hessian computations.
Findings
Achieves improved gradient query complexity for bounded dimension cases.
Provides a single Hessian query method with competitive performance.
Develops a general algorithm handling approximate Hessian computations.
Abstract
We develop optimization methods which offer new trade-offs between the number of gradient and Hessian computations needed to compute the critical point of a non-convex function. We provide a method that for any twice-differentiable with -Lipschitz Hessian, input initial point with -bounded sub-optimality, and sufficiently small , outputs an -critical point, i.e., a point such that , using queries to a gradient oracle and queries to a Hessian oracle for any positive integer . As a consequence, we obtain an improved gradient query complexity of in the case of bounded dimension and of in the case where we are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Topological and Geometric Data Analysis · Markov Chains and Monte Carlo Methods
