Instance-optimal stochastic convex optimization: Can we improve upon sample-average and robust stochastic approximation?
Liwei Jiang, Ashwin Pananjady

TL;DR
This paper introduces a variance reduction method called VISOR for stochastic convex optimization, outperforming traditional approaches like sample average approximation and stochastic approximation, and achieving near-optimal sample complexity.
Contribution
The paper proposes VISOR, a novel variance reduction strategy that is instance-optimal and improves sample complexity bounds in stochastic convex optimization.
Findings
VISOR significantly outperforms standard methods with the same sample size.
Finite-sample bounds show VISOR's near-optimal performance.
Application to generalized linear models yields the best-known non-asymptotic bounds.
Abstract
We study the unconstrained minimization of a smooth and strongly convex population loss function under a stochastic oracle that introduces both additive and multiplicative noise; this is a canonical and widely-studied setting that arises across operations research, signal processing, and machine learning. We begin by showing that standard approaches such as sample average approximation and robust (or averaged) stochastic approximation can lead to suboptimal -- and in some cases arbitrarily poor -- performance with realistic finite sample sizes. In contrast, we demonstrate that a carefully designed variance reduction strategy, which we term VISOR for short, can significantly outperform these approaches while using the same sample size. Our upper bounds are complemented by finite-sample, information-theoretic local minimax lower bounds, which highlight fundamental, instance-dependent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Statistical Methods and Inference · Privacy-Preserving Technologies in Data
