Pac-bayesian bounds for sparse regression estimation with exponential weights
Pierre Alquier, Karim Lounici

TL;DR
This paper introduces an improved exponential weight estimator for high-dimensional sparse regression, providing probabilistic guarantees and a practical MCMC algorithm for computation.
Contribution
It develops a new aggregation procedure with better statistical guarantees and proposes an MCMC method for efficient computation in large-scale problems.
Findings
Provides a sparsity oracle inequality in probability for the true excess risk.
Proposes an MCMC algorithm for scalable computation.
Achieves a good statistical-computational trade-off in high-dimensional sparse regression.
Abstract
We consider the sparse regression model where the number of parameters is larger than the sample size . The difficulty when considering high-dimensional problems is to propose estimators achieving a good compromise between statistical and computational performances. The BIC estimator for instance performs well from the statistical point of view \cite{BTW07} but can only be computed for values of of at most a few tens. The Lasso estimator is solution of a convex minimization problem, hence computable for large value of . However stringent conditions on the design are required to establish fast rates of convergence for this estimator. Dalalyan and Tsybakov \cite{arnak} propose a method achieving a good compromise between the statistical and computational aspects of the problem. Their estimator can be computed for reasonably large and satisfies nice statistical properties…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Sparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques
