Empirical Risk Minimization for Stochastic Convex Optimization:   $O(1/n)$- and $O(1/n^2)$-type of Risk Bounds

Lijun Zhang; Tianbao Yang; Rong Jin

arXiv:1702.02030·cs.LG·February 8, 2017·1 cites

Empirical Risk Minimization for Stochastic Convex Optimization: $O(1/n)$- and $O(1/n^2)$-type of Risk Bounds

Lijun Zhang, Tianbao Yang, Rong Jin

PDF

Open Access

TL;DR

This paper improves theoretical risk bounds for empirical risk minimization in stochastic convex optimization by exploiting smoothness and strong convexity, achieving near-optimal rates and extending to weaker conditions.

Contribution

It establishes new risk bounds for ERM in SCO, including the first $O(1/n^2)$-type bound, and unifies the analysis under weaker assumptions.

Findings

01

Achieves $ ilde{O}(d/n + oot{2} F_*)$ risk bound for smooth convex functions.

02

Proves an $O(rac{ ext{condition number}}{n^2})$ risk bound under strong convexity.

03

Replaces dimensionality-dependent sample complexity with a dimension-independent bound.

Abstract

Although there exist plentiful theories of empirical risk minimization (ERM) for supervised learning, current theoretical understandings of ERM for a related problem---stochastic convex optimization (SCO), are limited. In this work, we strengthen the realm of ERM for SCO by exploiting smoothness and strong convexity conditions to improve the risk bounds. First, we establish an $O (d / n + F_{*} / n)$ risk bound when the random function is nonnegative, convex and smooth, and the expected function is Lipschitz continuous, where $d$ is the dimensionality of the problem, $n$ is the number of samples, and $F_{*}$ is the minimal risk. Thus, when $F_{*}$ is small we obtain an $O (d / n)$ risk bound, which is analogous to the $O (1/ n)$ optimistic rate of ERM for supervised learning. Second, if the objective function is also $λ$ -strongly convex, we prove an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Risk and Portfolio Optimization · Advanced Causal Inference Techniques