Mini-batch stochastic gradient descent with dynamic sample sizes

Michael R. Metel

arXiv:1708.00555·math.OC·August 3, 2017·5 cites

Mini-batch stochastic gradient descent with dynamic sample sizes

Michael R. Metel

PDF

Open Access

TL;DR

This paper introduces dynamic sample size rules for mini-batch stochastic gradient descent to improve convergence in constrained convex optimization, supported by empirical results showing superiority over fixed sample methods.

Contribution

It proposes novel dynamic sample size strategies that adaptively ensure descent directions with high probability in mini-batch SGD.

Findings

01

Superior convergence compared to fixed sample implementations

02

Effective in constrained convex optimization problems

03

Empirical validation on two applications

Abstract

We focus on solving constrained convex optimization problems using mini-batch stochastic gradient descent. Dynamic sample size rules are presented which ensure a descent direction with high probability. Empirical results from two applications show superior convergence compared to fixed sample implementations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Machine Learning and Algorithms