Sparse Regression Learning by Aggregation and Langevin Monte-Carlo
Arnak Dalalyan (IGM-LabInfo), Alexandre B. Tsybakov (PMA)

TL;DR
This paper develops a PAC-Bayesian bound for exponential weighted aggregation in regression, introduces a sparsity oracle inequality for high-dimensional linear models, and proposes Langevin Monte-Carlo algorithms for efficient computation.
Contribution
It provides a novel PAC-Bayesian bound valid for unbounded functions, applies it to sparse high-dimensional regression, and introduces Langevin Monte-Carlo methods for scalable inference.
Findings
Bound holds for unbounded regression functions.
EWA achieves sparsity oracle inequality with leading constant one.
Langevin Monte-Carlo algorithms effectively approximate the EWA.
Abstract
We consider the problem of regression learning for deterministic design and independent random errors. We start by proving a sharp PAC-Bayesian type bound for the exponentially weighted aggregate (EWA) under the expected squared empirical loss. For a broad class of noise distributions the presented bound is valid whenever the temperature parameter of the EWA is larger than or equal to , where is the noise variance. A remarkable feature of this result is that it is valid even for unbounded regression functions and the choice of the temperature parameter depends exclusively on the noise level. Next, we apply this general bound to the problem of aggregating the elements of a finite-dimensional linear space spanned by a dictionary of functions . We allow to be much larger than the sample size but we assume that the true regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
