Adaptive estimation of High-Dimensional Signal-to-Noise Ratios
Nicolas Verzelen, Elisabeth Gassiat

TL;DR
This paper investigates the challenge of estimating signal-to-noise ratios in high-dimensional linear regression, proposing adaptive methods that achieve near-optimal rates even when key parameters like sparsity and design distribution are unknown.
Contribution
It introduces an adaptive estimation procedure for explained variance that attains minimax rates across unknown sparsity levels, highlighting the importance of design distribution knowledge.
Findings
Adaptive estimator achieves near-minimax rates across sparsity levels.
Unknown design distribution limits the regimes where consistent estimation is possible.
Knowledge of the design distribution significantly improves estimation accuracy.
Abstract
We consider the equivalent problems of estimating the residual variance, the proportion of explained variance and the signal strength in a high-dimensional linear regression model with Gaussian random design. Our aim is to understand the impact of not knowing the sparsity of the regression parameter and not knowing the distribution of the design on minimax estimation rates of . Depending on the sparsity of the regression parameter, optimal estimators of either rely on estimating the regression parameter or are based on U-type statistics, and have minimax rates depending on . In the important situation where is unknown, we build an adaptive procedure whose convergence rate simultaneously achieves the minimax risk over all up to a logarithmic loss which we prove to be non avoidable. Finally, the knowledge of the design distribution is shown to play a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
