# Representation Disentaglement via Regularization by Causal   Identification

**Authors:** Juan Castorena

arXiv: 2303.00128 · 2024-01-29

## TL;DR

This paper introduces a causal collider-based regularization method called ReI to improve disentangled representation learning, especially in biased datasets, by enforcing causal identification constraints.

## Contribution

It extends traditional models with causal collider structures and proposes ReI, a modular regularization technique that enhances disentanglement and interpretability in generative models.

## Key findings

- ReI outperforms existing methods on standard benchmarks.
- ReI produces interpretable, robust representations in real-world data.
- The approach effectively handles biased datasets with sampling selection bias.

## Abstract

In this work, we propose the use of a causal collider structured model to describe the underlying data generative process assumptions in disentangled representation learning. This extends the conventional i.i.d. factorization assumption model $p(\mathbf{y}) = \prod_{i} p(\mathbf{y}_i )$, inadequate to handle learning from biased datasets (e.g., with sampling selection bias). The collider structure, explains that conditional dependencies between the underlying generating variables may be exist, even when these are in reality unrelated, complicating disentanglement. Under the rubric of causal inference, we show this issue can be reconciled under the condition of causal identification; attainable from data and a combination of constraints, aimed at controlling the dependencies characteristic of the \textit{collider} model. For this, we propose regularization by identification (ReI), a modular regularization engine designed to align the behavior of large scale generative models with the disentanglement constraints imposed by causal identification. Empirical evidence on standard benchmarks demonstrates the superiority of ReI in learning disentangled representations in a variational framework. In a real-world dataset we additionally show that our framework, results in interpretable representations robust to out-of-distribution examples and that align with the true expected effect from domain knowledge.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/2303.00128/full.md

## Figures

27 figures with captions in the complete paper: https://tomesphere.com/paper/2303.00128/full.md

## References

60 references — full list in the complete paper: https://tomesphere.com/paper/2303.00128/full.md

---
Source: https://tomesphere.com/paper/2303.00128