Optimistic Rates for Learning with a Smooth Loss

Nathan Srebro; Karthik Sridharan; Ambuj Tewari

arXiv:1009.3896·cs.LG·November 27, 2012·45 cites

Optimistic Rates for Learning with a Smooth Loss

Nathan Srebro, Karthik Sridharan, Ambuj Tewari

PDF

Open Access

TL;DR

This paper derives new excess risk bounds for empirical risk minimization using smooth loss functions, showing improved learning rates that depend on the smoothness and complexity of the hypothesis class.

Contribution

It introduces novel excess risk bounds for smooth loss functions in ERM, online, and stochastic convex optimization, with explicit rates depending on smoothness and complexity.

Findings

01

Achieves an O(H R_n^2 + R_n H L*) excess risk bound.

02

For typical classes, obtains an O(RH/n) rate in the separable case.

03

Provides guarantees for online and stochastic convex optimization with smooth objectives.

Abstract

We establish an excess risk bound of O(H R_n^2 + R_n \sqrt{H L*}) for empirical risk minimization with an H-smooth loss function and a hypothesis class with Rademacher complexity R_n, where L* is the best risk achievable by the hypothesis class. For typical hypothesis classes where R_n = \sqrt{R/n}, this translates to a learning rate of O(RH/n) in the separable (L*=0) case and O(RH/n + \sqrt{L^* RH/n}) more generally. We also provide similar guarantees for online and stochastic convex optimization with a smooth non-negative objective.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques