Empirical Risk Minimization for Stochastic Convex Optimization: $O(1/n)$- and $O(1/n^2)$-type of Risk Bounds
Lijun Zhang, Tianbao Yang, Rong Jin

TL;DR
This paper improves theoretical risk bounds for empirical risk minimization in stochastic convex optimization by exploiting smoothness and strong convexity, achieving near-optimal rates and extending to weaker conditions.
Contribution
It establishes new risk bounds for ERM in SCO, including the first $O(1/n^2)$-type bound, and unifies the analysis under weaker assumptions.
Findings
Achieves $ ilde{O}(d/n + oot{2} F_*)$ risk bound for smooth convex functions.
Proves an $O(rac{ ext{condition number}}{n^2})$ risk bound under strong convexity.
Replaces dimensionality-dependent sample complexity with a dimension-independent bound.
Abstract
Although there exist plentiful theories of empirical risk minimization (ERM) for supervised learning, current theoretical understandings of ERM for a related problem---stochastic convex optimization (SCO), are limited. In this work, we strengthen the realm of ERM for SCO by exploiting smoothness and strong convexity conditions to improve the risk bounds. First, we establish an risk bound when the random function is nonnegative, convex and smooth, and the expected function is Lipschitz continuous, where is the dimensionality of the problem, is the number of samples, and is the minimal risk. Thus, when is small we obtain an risk bound, which is analogous to the optimistic rate of ERM for supervised learning. Second, if the objective function is also -strongly convex, we prove an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Risk and Portfolio Optimization · Advanced Causal Inference Techniques
