Confidence Sets Based on Thresholding Estimators in High-Dimensional Gaussian Regression Models
Ulrike Schneider

TL;DR
This paper analyzes confidence intervals based on thresholding estimators in high-dimensional Gaussian regression, revealing their larger size compared to standard intervals and providing finite-sample and asymptotic coverage properties.
Contribution
It offers a detailed finite-sample and asymptotic analysis of confidence intervals based on thresholding estimators in high-dimensional settings, including unknown variance cases.
Findings
Thresholding-based intervals are larger than standard intervals.
Asymptotic coverage properties depend on the divergence of sample size and regressors.
Finite-sample bounds for coverage probabilities are established.
Abstract
We study confidence intervals based on hard-thresholding, soft-thresholding, and adaptive soft-thresholding in a linear regression model where the number of regressors may depend on and diverge with sample size . In addition to the case of known error variance, we define and study versions of the estimators when the error variance is unknown. In the known variance case, we provide an exact analysis of the coverage properties of such intervals in finite samples. We show that these intervals are always larger than the standard interval based on the least-squares estimator. Asymptotically, the intervals based on the thresholding estimators are larger even by an order of magnitude when the estimators are tuned to perform consistent variable selection. For the unknown-variance case, we provide non-trivial lower bounds for the coverage probabilities in finite samples and conduct an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Regression
