Towards Unsupervised Causal Representation Learning via Latent Additive Noise Model Causal Autoencoders
Hans Jarett J. Ong, Brian Godwin S. Lim, Dominic Dayta, Renzo Roel P. Tan, Kazushi Ikeda

TL;DR
This paper introduces LANCA, a novel autoencoder-based method that uses the Additive Noise Model as an inductive bias to improve unsupervised causal representation learning, addressing identifiability issues.
Contribution
LANCA operationalizes the ANM as a strong inductive bias within a deterministic WAE framework, enhancing causal disentanglement without supervision.
Findings
LANCA outperforms state-of-the-art methods on synthetic physics benchmarks.
LANCA demonstrates robustness to spurious correlations in complex environments.
Theoretical analysis shows ANM constrains transformations to affine class.
Abstract
Unsupervised representation learning seeks to recover latent generative factors, yet standard methods relying on statistical independence often fail to capture causal dependencies. A central challenge is identifiability: as established in disentangled representation learning and nonlinear ICA literature, disentangling causal variables from observational data is impossible without supervision, auxiliary signals, or strong inductive biases. In this work, we propose the Latent Additive Noise Model Causal Autoencoder (LANCA) to operationalize the Additive Noise Model (ANM) as a strong inductive bias for unsupervised discovery. Theoretically, we prove that while the ANM constraint does not guarantee unique identifiability in the general mixing case, it resolves component-wise indeterminacy by restricting the admissible transformations from arbitrary diffeomorphisms to the affine class.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Bayesian Modeling and Causal Inference · Explainable Artificial Intelligence (XAI)
