CITRIS: Causal Identifiability from Temporal Intervened Sequences

Phillip Lippe; Sara Magliacane; Sindy L\"owe; Yuki M. Asano; Taco; Cohen; Efstratios Gavves

arXiv:2202.03169·cs.LG·June 16, 2022·6 cites

CITRIS: Causal Identifiability from Temporal Intervened Sequences

Phillip Lippe, Sara Magliacane, Sindy L\"owe, Yuki M. Asano, Taco, Cohen, Efstratios Gavves

PDF

Open Access 2 Repos

TL;DR

CITRIS is a variational autoencoder framework that identifies causal factors from temporal image sequences with interventions, leveraging temporality and pretrained autoencoders to improve causal representation learning and generalization.

Contribution

It introduces CITRIS, a novel method that exploits temporal data and intervention targets to identify causal factors, extending identifiability results to more complex settings.

Findings

01

Outperforms previous methods in recovering causal variables from 3D image sequences.

02

Can leverage pretrained autoencoders to generalize to unseen causal factor instantiations.

03

Proves identifiability in settings where only some components of causal factors are intervened.

Abstract

Understanding the latent causal factors of a dynamical system from visual observations is considered a crucial step towards agents reasoning in complex environments. In this paper, we propose CITRIS, a variational autoencoder framework that learns causal representations from temporal sequences of images in which underlying causal factors have possibly been intervened upon. In contrast to the recent literature, CITRIS exploits temporality and observing intervention targets to identify scalar and multidimensional causal factors, such as 3D rotation angles. Furthermore, by introducing a normalizing flow, CITRIS can be easily extended to leverage and disentangle representations obtained by already pretrained autoencoders. Extending previous results on scalar causal factors, we prove identifiability in a more general setting, in which only some components of a causal factor are affected by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Vision and Imaging