Robust Representation Learning through Explicit Environment Modeling
Yuli Slavutsky, David M. Blei

TL;DR
This paper introduces a method for learning robust representations across multiple environments by explicitly modeling and marginalizing environment variation, outperforming invariant methods in challenging settings.
Contribution
It proposes a generalized random-intercept model that explicitly accounts for environment variation, providing a new approach to robust representation learning.
Findings
Models outperform invariant-learning methods in various challenging scenarios.
Explicit environment modeling improves generalization to unseen environments.
Theoretical analysis characterizes when these models are preferable.
Abstract
We consider learning from labeled data collected across multiple environments, where the data distribution may vary across these environments. This problem is commonly approached from a causal perspective, seeking invariant representations that retain causal factors while discarding spurious ones. However, this framework assumes that the environment has no direct effect on the target. In contrast, we consider settings in which this assumption fails, but still aim to learn representations that support robust prediction on average across previously unseen environments. To this end, we study representations learned by explicitly modeling variation across environments and then marginalizing that variation out. We analyze the resulting representations and characterize when they are preferable to those learned by causal invariant-representation methods. We propose a concrete method based on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
