Post-selection point and interval estimation of signal sizes in Gaussian samples
Stephen Reid, Jonathan Taylor, Robert Tibshirani

TL;DR
This paper introduces a novel post-selection inference method for estimating signal sizes in Gaussian samples, leveraging recent advances in Lasso-based inference, and demonstrates its effectiveness and theoretical optimality in sparse settings.
Contribution
It adapts recent post-selection inference techniques to the orthogonal setting for estimating multiple means, providing new point and interval estimates with proven risk bounds.
Findings
Proposed estimators perform well against existing methods.
Established an upper bound on worst-case risk within a constant factor of minimax risk.
Framework applies to various selection procedures like top-K and BH.
Abstract
We tackle the problem of the estimation of a vector of means from a single vector-valued observation . Whereas previous work reduces the size of the estimates for the largest (absolute) sample elements via shrinkage (like James-Stein) or biases estimated via empirical Bayes methodology, we take a novel approach. We adapt recent developments by Lee et al (2013) in post selection inference for the Lasso to the orthogonal setting, where sample elements have different underlying signal sizes. This is exactly the setup encountered when estimating many means. It is shown that other selection procedures, like selecting the largest (absolute) sample elements and the Benjamini-Hochberg procedure, can be cast into their framework, allowing us to leverage their results. Point and interval estimates for signal sizes are proposed. These seem to perform quite well against competitors, both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
