Sparse Regression Learning by Aggregation and Langevin Monte-Carlo

Arnak Dalalyan (IGM-LabInfo); Alexandre B. Tsybakov (PMA)

arXiv:0903.1223·stat.AP·June 27, 2012·J. Comput. Syst. Sci.

Sparse Regression Learning by Aggregation and Langevin Monte-Carlo

Arnak Dalalyan (IGM-LabInfo), Alexandre B. Tsybakov (PMA)

PDF

TL;DR

This paper develops a PAC-Bayesian bound for exponential weighted aggregation in regression, introduces a sparsity oracle inequality for high-dimensional linear models, and proposes Langevin Monte-Carlo algorithms for efficient computation.

Contribution

It provides a novel PAC-Bayesian bound valid for unbounded functions, applies it to sparse high-dimensional regression, and introduces Langevin Monte-Carlo methods for scalable inference.

Findings

01

Bound holds for unbounded regression functions.

02

EWA achieves sparsity oracle inequality with leading constant one.

03

Langevin Monte-Carlo algorithms effectively approximate the EWA.

Abstract

We consider the problem of regression learning for deterministic design and independent random errors. We start by proving a sharp PAC-Bayesian type bound for the exponentially weighted aggregate (EWA) under the expected squared empirical loss. For a broad class of noise distributions the presented bound is valid whenever the temperature parameter $β$ of the EWA is larger than or equal to $4 σ^{2}$ , where $σ^{2}$ is the noise variance. A remarkable feature of this result is that it is valid even for unbounded regression functions and the choice of the temperature parameter depends exclusively on the noise level. Next, we apply this general bound to the problem of aggregating the elements of a finite-dimensional linear space spanned by a dictionary of functions $ϕ_{1}, ..., ϕ_{M}$ . We allow $M$ to be much larger than the sample size $n$ but we assume that the true regression…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.