Double-estimation-friendly inference for high-dimensional misspecified   models

Rajen D. Shah; Peter B\"uhlmann

arXiv:1909.10828·math.ST·May 20, 2022

Double-estimation-friendly inference for high-dimensional misspecified models

Rajen D. Shah, Peter B\"uhlmann

PDF

Open Access

TL;DR

This paper explores the robustness of inference methods in high-dimensional models under misspecification, proposing a flexible framework that maintains valid tests and confidence intervals even when models are not perfectly specified.

Contribution

It introduces a methodology for high-dimensional regression that preserves the double-estimation-friendly property, allowing valid inference under model misspecification.

Findings

01

Valid inference under misspecification in high-dimensional settings

02

Extension of DEF property to generalized linear models

03

Numerical experiments confirm effectiveness

Abstract

All models may be wrong -- but that is not necessarily a problem for inference. Consider the standard $t$ -test for the significance of a variable $X$ for predicting response $Y$ whilst controlling for $p$ other covariates $Z$ in a random design linear model. This yields correct asymptotic type~I error control for the null hypothesis that $X$ is conditionally independent of $Y$ given $Z$ under an \emph{arbitrary} regression model of $Y$ on $(X, Z)$ , provided that a linear regression model for $X$ on $Z$ holds. An analogous robustness to misspecification, which we term the "double-estimation-friendly" (DEF) property, also holds for Wald tests in generalised linear models, with some small modifications. In this expository paper we explore this phenomenon, and propose methodology for high-dimensional regression settings that respects the DEF property. We advocate specifying (sparse)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Advanced Causal Inference Techniques