On the boundedness of the sequence generated by minibatch stochastic gradient descent

Heinz H. Bauschke; Tran Thanh Tung

arXiv:2506.23303·math.OC·July 1, 2025

On the boundedness of the sequence generated by minibatch stochastic gradient descent

Heinz H. Bauschke, Tran Thanh Tung

PDF

Open Access

TL;DR

This paper investigates the conditions under which the sequence generated by minibatch stochastic gradient descent remains bounded, extending previous results to broader classes of functions including coercive functions.

Contribution

It generalizes the boundedness results of SGD with Polyak's stepsize to include coercive functions, beyond the previously known strong convexity case.

Findings

01

Boundedness holds for a broader class of functions including coercive functions.

02

A case is presented where boundedness may or may not hold.

03

Extends theoretical understanding of SGD convergence properties.

Abstract

Stochastic Gradient Descent (SGD) with Polyak's stepsize has recently gained renewed attention in stochastic optimization. Recently, Orvieto, Lacoste-Julien, and Loizou introduced a decreasing variant of Polyak's stepsize, where convergence relies on a boundedness assumption of the iterates. They established that this assumption holds under strong convexity. In this paper, we extend their result by proving that boundedness also holds for a broader class of objective functions, including coercive functions. We also present a case in which boundedness may or may not hold.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Stochastic processes and financial applications · Risk and Portfolio Optimization