A causal framework for distribution generalization

Rune Christiansen; Niklas Pfister; Martin Emil Jakobsen; Nicola Gnecco; and Jonas Peters

arXiv:2006.07433·stat.ME·August 19, 2021·IEEE Trans. Pattern Anal. Mach. Intell.

A causal framework for distribution generalization

Rune Christiansen, Niklas Pfister, Martin Emil Jakobsen, Nicola Gnecco, and Jonas Peters

PDF

1 Repo

TL;DR

This paper develops a causal framework for distribution generalization, analyzing how to predict responses under distribution shifts caused by interventions, and introduces a practical method for nonlinear models.

Contribution

It formalizes distribution generalization in nonlinear causal models, characterizes when causal models are minimax optimal, and proposes a consistent method called NILE for nonlinear IV settings.

Findings

01

NILE achieves distribution generalization in nonlinear IV models.

02

The framework characterizes conditions for causal models to be minimax optimal.

03

Empirical results demonstrate NILE's effectiveness in practice.

Abstract

We consider the problem of predicting a response $Y$ from a set of covariates $X$ when test and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain unchanged under arbitrary interventions on the covariates, but they are not always optimal in the above sense. For example, for linear models and bounded interventions, alternative solutions have been shown to be minimax prediction optimal. We introduce the formal framework of distribution generalization that allows us to analyze the above problem in partially observed nonlinear models for both direct interventions on $X$ and interventions that occur indirectly via exogenous variables $A$ . It…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

runesen/NILE
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.