Intervening to Learn and Compose Causally Disentangled Representations

Alex Markham; Isaac Hirsch; Jeri A. Chang; Liam Solus; Bryon Aragam

arXiv:2507.04754·stat.ML·April 3, 2026

Intervening to Learn and Compose Causally Disentangled Representations

Alex Markham, Isaac Hirsch, Jeri A. Chang, Liam Solus, Bryon Aragam

PDF

TL;DR

This paper introduces a novel training method for generative models that achieves causally disentangled representations by adding a context module, enabling out-of-distribution generation and extending identifiability theory.

Contribution

It proposes a simple yet effective context module that allows arbitrarily expressive models to learn causally disentangled concepts during training.

Findings

01

The approach produces causally disentangled representations capable of out-of-distribution generation.

02

The method can be integrated into end-to-end training or fine-tuning of pre-trained models.

03

A new identifiability theorem extends existing results on structured representations.

Abstract

In designing generative models, it is commonly believed that in order to learn useful latent structure, we face a fundamental tension between expressivity and structure. In this paper we challenge this view by proposing a new approach to training arbitrarily expressive generative models that simultaneously learn causally disentangled concepts. This is accomplished by adding a simple context module to an arbitrarily complex black-box model, which learns to process concept information by implicitly inverting linear representations from the model's encoder. Inspired by the notion of intervention in a causal model, our module selectively modifies its architecture during training, allowing it to learn a compact joint model over different contexts. We show how adding this module leads to causally disentangled representations that can be composed for out-of-distribution generation on both real…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.