Invariant Models for Causal Transfer Learning
Mateo Rojas-Carulla, Bernhard Sch\"olkopf, Richard Turner, Jonas, Peters

TL;DR
This paper introduces invariant models based on causal assumptions for transfer learning, particularly in domain generalization, demonstrating their effectiveness through theoretical analysis and experiments on synthetic and biological data.
Contribution
It proposes a causal-inspired invariant subset approach for transfer learning, with a practical method for automatic subset inference and theoretical guarantees in domain generalization.
Findings
Invariant subset improves domain generalization performance.
The method outperforms data pooling in diverse task settings.
Experimental results validate the approach on synthetic and gene deletion datasets.
Abstract
Methods of transfer learning try to combine knowledge from several related tasks (or domains) to improve performance on a test task. Inspired by causal methodology, we relax the usual covariate shift assumption and assume that it holds true for a subset of predictor variables: the conditional distribution of the target variable given this subset of predictors is invariant over all tasks. We show how this assumption can be motivated from ideas in the field of causality. We focus on the problem of Domain Generalization, in which no examples from the test task are observed. We prove that in an adversarial setting using this subset for prediction is optimal in Domain Generalization; we further provide examples, in which the tasks are sufficiently diverse and the estimator therefore outperforms pooling the data, even on average. If examples from the test task are available, we also provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Bayesian Modeling and Causal Inference
