Optimal learning with $Q$-aggregation

Guillaume Lecu\'e; Philippe Rigollet

arXiv:1301.6080·math.ST·February 28, 2014

Optimal learning with $Q$-aggregation

Guillaume Lecu\'e, Philippe Rigollet

PDF

TL;DR

This paper develops an optimal model selection aggregation method using $Q$-aggregation for supervised learning with convex and Lipschitz loss functions, extending previous Gaussian regression results.

Contribution

It generalizes $Q$-aggregation to broader supervised learning settings with convex, Lipschitz loss, achieving optimal oracle inequalities.

Findings

01

Estimator satisfies optimal oracle inequalities in expectation.

02

Estimator performs well with high probability.

03

Method extends previous Gaussian regression results.

Abstract

We consider a general supervised learning problem with strongly convex and Lipschitz loss and study the problem of model selection aggregation. In particular, given a finite dictionary functions (learners) together with the prior, we generalize the results obtained by Dai, Rigollet and Zhang [Ann. Statist. 40 (2012) 1878-1905] for Gaussian regression with squared loss and fixed design to this learning setup. Specifically, we prove that the $Q$ -aggregation procedure outputs an estimator that satisfies optimal oracle inequalities both in expectation and with high probability. Our proof techniques somewhat depart from traditional proofs by making most of the standard arguments on the Laplace transform of the empirical process to be controlled.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.