A general framework for estimation and inference from clusters of   features

Stephen Reid; Jonathan Taylor; Robert Tibshirani

arXiv:1511.07839·stat.AP·November 25, 2015

A general framework for estimation and inference from clusters of features

Stephen Reid, Jonathan Taylor, Robert Tibshirani

PDF

Open Access

TL;DR

This paper introduces a new framework for testing group-wide signals in predictor clusters, using prototype-based models and selective inference to improve power over classical methods.

Contribution

It proposes a novel prototype model and testing procedure that incorporate response information and account for variable selection, enhancing inference in grouped predictor settings.

Findings

01

Proposed tests outperform classical methods in power.

02

Use of response-informed prototypes improves detection.

03

Selective inference ensures valid p-values despite variable selection.

Abstract

Applied statistical problems often come with pre-specified groupings to predictors. It is natural to test for the presence of simultaneous group-wide signal for groups in isolation, or for multiple groups together. Classical tests for the presence of such signals rely either on tests for the omission of the entire block of variables (the classical F-test) or on the creation of an unsupervised prototype for the group (either a group centroid or first principal component) and subsequent t-tests on these prototypes. In this paper, we propose test statistics that aim for power improvements over these classical approaches. In particular, we first create group prototypes, with reference to the response, hopefully improving on the unsupervised prototypes, and then testing with likelihood ratio statistics incorporating only these prototypes. We propose a (potentially) novel model, called the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Bayesian Methods and Mixture Models