Multi-View Causal Representation Learning with Partial Observability
Dingling Yao, Danru Xu, S\'ebastien Lachapelle, Sara Magliacane,, Perouz Taslakian, Georg Martius, Julius von K\"ugelgen, Francesco, Locatello

TL;DR
This paper introduces a theoretical framework for learning identifiable causal representations from multiple partially observed views using contrastive learning, extending previous work on multi-view nonlinear ICA and disentanglement.
Contribution
It provides a unified theoretical approach for identifying shared causal latent variables from partial views, including graphical criteria and a contrastive learning method.
Findings
Shared information across views can be learned up to a smooth bijection.
Graphical criteria determine which latent variables are identifiable.
Prior methods are recovered as special cases of this framework.
Abstract
We present a unified framework for studying the identifiability of representations learned from simultaneously observed views, such as different data modalities. We allow a partially observed setting in which each view constitutes a nonlinear mixture of a subset of underlying latent variables, which can be causally related. We prove that the information shared across all subsets of any number of views can be learned up to a smooth bijection using contrastive learning and a single encoder per view. We also provide graphical criteria indicating which latent variables can be identified through a simple set of rules, which we refer to as identifiability algebra. Our general framework and theoretical results unify and extend several previous works on multi-view nonlinear ICA, disentanglement, and causal representation learning. We experimentally validate our claims on numerical, image, and…
Peer Reviews
Decision·ICLR 2024 spotlight
This paper is written extremely well, and reads like a chapter from a textbook. The authors frequently refer to a running example and include “intuition” paragraphs to make it easier to understand their definitions and results. The loss that leads to identifiability (Eq. 3.1) is simple and intuitive, which makes me optimistic about the (eventual) practical applicability of this approach. The authors’ framework unifies many existing theoretical results in nonlinear ICA and causal representation l
The main weakness of this work is its impracticality and lack of experimental results on realistic datasets. However, this is also acknowledged by the authors, and can also be said regarding most papers in this research area.
- Causal representation learning has gotten a lot of attention recently, due to the promises it holds. This work is another important step in this direction. - The general identifiability result captures a variety of prior works on CRL, therefore it serves as a neat unifying contribution. - The approach to avoid using exponential set of encoders for each subset of views is very neat and is crucial for this work, for efficiency purposes.
- R^2 metric is reported for validating identifiability, however as the authors also note, there are instances when R^2 scores can be inflated. Why didn't the authors also cite other standard identifiability metrics like MCC (such as the ones reported in [1], [2])?
• The paper is well-written and communicates clearly motivations, formulation, technical details, and theoretical implications. • This paper extends previous content-identification results (mainly von Kugelgen et al. 2021, Daunhawer et al. 2023) to multi-view distributions and further (identifiability algebra). • The empirical evaluation is thorough (over multi-view/modality/task datasets) and well-designed to substantiate the theoretical findings in the paper.
• My primary concern is the novelty of the work. In comparison with prior work (von Kugelgen et al. 2021; Daunhawer et al. 2023), the technical contribution appears somewhat incremental. • The assumption that each view-specific generating function is invertible is strong, especially over multiple modalities. Intuitively, this requires the shared blocks to be duplicated multiple times and thus simplifies the identification problem. A recent work [1] has attempted to weaken this assumption. [1]
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
MethodsSparse Evolutionary Training · Independent Component Analysis · Contrastive Learning
