Sanity Checking Causal Representation Learning on a Simple Real-World   System

Juan L. Gamella; Simon Bing; Jakob Runge

arXiv:2502.20099·cs.LG·April 29, 2025

Sanity Checking Causal Representation Learning on a Simple Real-World System

Juan L. Gamella, Simon Bing, Jakob Runge

PDF

Open Access 2 Repos

TL;DR

This paper evaluates causal representation learning methods on a controlled real-world optical system with known causal factors, revealing significant failures and reproducibility issues that challenge their practical applicability.

Contribution

It provides a benchmark and analysis highlighting the limitations of current CRL methods in real-world settings, emphasizing the need for more robust approaches.

Findings

01

All evaluated methods failed to recover causal factors.

02

Most methods failed even on simplified synthetic data.

03

Assumptions on mixing functions are crucial but often violated.

Abstract

We evaluate methods for causal representation learning (CRL) on a simple, real-world system where these methods are expected to work. The system consists of a controlled optical experiment specifically built for this purpose, which satisfies the core assumptions of CRL and where the underlying causal factors (the inputs to the experiment) are known, providing a ground truth. We select methods representative of different approaches to CRL and find that they all fail to recover the underlying causal factors. To understand the failure modes of the evaluated algorithms, we perform an ablation on the data by substituting the real data-generating process with a simpler synthetic equivalent. The results reveal a reproducibility problem, as most methods already fail on this synthetic ablation despite its simple data-generating process. Additionally, we observe that common assumptions on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI) · Generative Adversarial Networks and Image Synthesis