Minimax risks for sparse regressions: Ultra-high-dimensional phenomenons
Nicolas Verzelen (MISTEA)

TL;DR
This paper investigates the fundamental limits of estimation and testing in ultra-high-dimensional sparse linear regression, revealing phase transitions and the impact of unknown noise variance on minimax risks.
Contribution
It derives minimax risk bounds for sparse regression and testing, highlighting the limitations in ultra-high-dimensional regimes and the effect of unknown noise variance.
Findings
Minimax risks blow up when $k\log(p/k)$ exceeds $n$.
Dimension reduction techniques are ineffective in ultra-high-dimensional settings.
Unknown noise variance significantly affects optimal estimation and testing rates.
Abstract
Consider the standard Gaussian linear regression model , where is a response vector and is a design matrix. Numerous work have been devoted to building efficient estimators of when is much larger than . In such a situation, a classical approach amounts to assume that is approximately sparse. This paper studies the minimax risks of estimation and testing over classes of -sparse vectors . These bounds shed light on the limitations due to high-dimensionality. The results encompass the problem of prediction (estimation of ), the inverse problem (estimation of ) and linear testing (testing ). Interestingly, an elbow effect occurs when the number of variables becomes large compared to . Indeed, the minimax risks and hypothesis separation distances blow up in this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
