CITRIS: Causal Identifiability from Temporal Intervened Sequences
Phillip Lippe, Sara Magliacane, Sindy L\"owe, Yuki M. Asano, Taco, Cohen, Efstratios Gavves

TL;DR
CITRIS is a variational autoencoder framework that identifies causal factors from temporal image sequences with interventions, leveraging temporality and pretrained autoencoders to improve causal representation learning and generalization.
Contribution
It introduces CITRIS, a novel method that exploits temporal data and intervention targets to identify causal factors, extending identifiability results to more complex settings.
Findings
Outperforms previous methods in recovering causal variables from 3D image sequences.
Can leverage pretrained autoencoders to generalize to unseen causal factor instantiations.
Proves identifiability in settings where only some components of causal factors are intervened.
Abstract
Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recent literature, CITRIS exploits temporality and observing intervention targets to identify scalar and multidimensional causal factors, such as 3D rotation angles. Furthermore, by introducing a normalizing flow, CITRIS can be easily extended to leverage and disentangle representations obtained by already pretrained autoencoders. Extending previous results on scalar causal factors, we prove identifiability in a more general setting, in which only some components of a causal factor are affected by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Vision and Imaging
