The Sample Complexity of Parameter-Free Stochastic Convex Optimization

Jared Lawrence; Ari Kalinsky; Hannah Bradfield; Yair Carmon; Oliver Hinder

arXiv:2506.11336·cs.LG·June 16, 2025

The Sample Complexity of Parameter-Free Stochastic Convex Optimization

Jared Lawrence, Ari Kalinsky, Hannah Bradfield, Yair Carmon, Oliver Hinder

PDF

Open Access

TL;DR

This paper investigates the sample complexity of stochastic convex optimization without known problem parameters, introducing adaptive methods for tuning and regularization that improve performance and avoid overfitting.

Contribution

It proposes a reliable model selection technique and a regularization-based method for parameter-free stochastic convex optimization, enhancing adaptability and theoretical guarantees.

Findings

01

Model selection method matches optimal sample complexity up to log-log factors.

02

Regularization method adapts perfectly to unknown distance to optimality.

03

Experiments show improved performance in few-shot learning and shape counting tasks.

Abstract

We study the sample complexity of stochastic convex optimization when problem parameters, e.g., the distance to optimality, are unknown. We pursue two strategies. First, we develop a reliable model selection method that avoids overfitting the validation set. This method allows us to generically tune the learning rate of stochastic optimization methods to match the optimal known-parameter sample complexity up to $lo g lo g$ factors. Second, we develop a regularization-based method that is specialized to the case that only the distance to optimality is unknown. This method provides perfect adaptability to unknown distance to optimality, demonstrating a separation between the sample and computational complexity of parameter-free stochastic convex optimization. Combining these two methods allows us to simultaneously adapt to multiple problem structures. Experiments performing few-shot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Stochastic Gradient Optimization Techniques · Statistical Methods and Inference

MethodsContrastive Language-Image Pre-training