A Scale-Invariant Sorting Criterion to Find a Causal Order in Additive Noise Models
Alexander G. Reisach, Myriam Tami, Christof Seiler, Antoine Chambaz,, Sebastian Weichwald

TL;DR
This paper introduces a scale-invariant pattern called high R^2-sortability in additive noise models, enabling efficient causal order discovery by sorting variables based on their explained variance, and surpassing existing algorithms in performance.
Contribution
The paper proposes the R^2-SortnRegress algorithm leveraging high R^2-sortability for causal discovery, providing a scale-invariant alternative to variance-based methods, and offers analytical insights into its effectiveness.
Findings
High R^2 increases along causal chains in ANMs.
R^2-SortnRegress matches or exceeds established algorithms.
High R^2-sortability is common in typical ANM settings.
Abstract
Additive Noise Models (ANMs) are a common model class for causal discovery from observational data and are often used to generate synthetic data for causal discovery benchmarking. Specifying an ANM requires choosing all parameters, including those not fixed by explicit assumptions. Reisach et al. (2021) show that sorting variables by increasing variance often yields an ordering close to a causal order and introduce var-sortability to quantify this alignment. Since increasing variances may be unrealistic and are scale-dependent, ANM data are often standardized in benchmarks. We show that synthetic ANM data are characterized by another pattern that is scale-invariant: the explainable fraction of a variable's variance, as captured by the coefficient of determination , tends to increase along the causal order. The result is high -sortability, meaning that sorting the variables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsData Quality and Management · Bayesian Modeling and Causal Inference · Topic Modeling
