Post-selection inference for penalized M-estimators via score thinning
Ronan Perry, Snigdha Panigrahi, Daniela Witten

TL;DR
This paper introduces a simple, asymptotically valid method for post-selection inference in penalized M-estimators that avoids bespoke procedures by leveraging noise addition and score variable properties.
Contribution
It proposes a novel approach that uses noise addition to achieve independence between model selection and inference, simplifying post-selection inference for penalized M-estimators.
Findings
Asymptotic independence between model selection and inference established.
Method works with standard software like glmnet and glm in R.
Applicable under weak distributional assumptions with independent observations.
Abstract
We consider inference for M-estimators after model selection using a sparsity-inducing penalty. While existing methods for this task require bespoke inference procedures, we propose a simpler approach, which relies on two insights: (i) adding and subtracting carefully-constructed noise to a Gaussian random variable with unknown mean and known variance leads to two \emph{independent} Gaussian random variables; and (ii) both the selection event resulting from penalized M-estimation, and the event that a standard (non-selective) confidence interval for an M-estimator covers its target, can be characterized in terms of an approximately normal ``score variable". We combine these insights to show that -- when the noise is chosen carefully -- there is asymptotic independence between the model selected using a noisy penalized M-estimator, and the event that a standard (non-selective) confidence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Gaussian Processes and Bayesian Inference
