Saving Gradient and Negative Curvature Computations: Finding Local   Minima More Efficiently

Yaodong Yu; Difan Zou; Quanquan Gu

arXiv:1712.03950·cs.LG·December 12, 2017·6 cites

Saving Gradient and Negative Curvature Computations: Finding Local Minima More Efficiently

Yaodong Yu, Difan Zou, Quanquan Gu

PDF

Open Access

TL;DR

This paper introduces a family of nonconvex optimization algorithms that efficiently find local minima by reducing gradient and negative curvature computations, improving runtime over existing methods.

Contribution

The algorithms divide the domain into large and small gradient regions, performing targeted descent steps, and can escape small gradient regions with minimal negative curvature computations.

Findings

01

Can escape small gradient regions in one negative curvature step

02

Potentially outperform state-of-the-art local minima algorithms

03

Effective in both deterministic and stochastic settings

Abstract

We propose a family of nonconvex optimization algorithms that are able to save gradient and negative curvature computations to a large extent, and are guaranteed to find an approximate local minimum with improved runtime complexity. At the core of our algorithms is the division of the entire domain of the objective function into small and large gradient regions: our algorithms only perform gradient descent based procedure in the large gradient region, and only perform negative curvature descent in the small gradient region. Our novel analysis shows that the proposed algorithms can escape the small gradient region in only one negative curvature descent step whenever they enter it, and thus they only need to perform at most $N_{ϵ}$ negative curvature direction computations, where $N_{ϵ}$ is the number of times the algorithms enter small gradient regions. For both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Computational Geometry and Mesh Generation