Selective inference in regression models with groups of variables
Joshua R. Loftus, Jonathan E. Taylor

TL;DR
This paper develops a mathematical framework for conducting valid statistical inference after model selection in regression models with grouped variables, applicable to various selection criteria and implemented in R.
Contribution
It introduces a general approach for selective inference in models with grouped variables, including exact significance tests and software implementation.
Findings
Provides an exact significance test for each selected group.
Applicable to models chosen by AIC, BIC, or fixed steps.
Efficient R package implementation available.
Abstract
We provide a general mathematical framework for selective inference with supervised model selection procedures characterized by quadratic forms in the outcome variable. Forward stepwise with groups of variables is an important special case as it allows models with categorical variables or factors. Models can be chosen by AIC, BIC, or a fixed number of steps. We provide an exact significance test for each group of variables in the selected model based on an appropriately truncated or distribution for the cases of known and unknown respectively. An efficient software implementation is available as a package in the R statistical programming language.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
