Adaptive Sampling Strategies for Stochastic Optimization

Raghu Bollapragada; Richard Byrd; Jorge Nocedal

arXiv:1710.11258·math.OC·November 1, 2017·SIAM J. Optim.

Adaptive Sampling Strategies for Stochastic Optimization

Raghu Bollapragada, Richard Byrd, Jorge Nocedal

PDF

TL;DR

This paper introduces an adaptive stochastic optimization method that dynamically adjusts sample sizes based on an inner product test, achieving global convergence and efficient variance reduction without full gradient computations.

Contribution

The paper presents a novel adaptive sampling strategy using an inner product test, improving variance control and convergence guarantees over existing methods.

Findings

01

The method ensures descent directions with high probability.

02

It achieves global convergence on nonconvex functions.

03

It attains a linear convergence rate on strongly convex functions.

Abstract

In this paper, we propose a stochastic optimization method that adaptively controls the sample size used in the computation of gradient approximations. Unlike other variance reduction techniques that either require additional storage or the regular computation of full gradients, the proposed method reduces variance by increasing the sample size as needed. The decision to increase the sample size is governed by an inner product test that ensures that search directions are descent directions with high probability. We show that the inner product test improves upon the well known norm test, and can be used as a basis for an algorithm that is globally convergent on nonconvex functions and enjoys a global linear rate of convergence on strongly convex functions. Numerical experiments on logistic regression problems illustrate the performance of the algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression