Deriving Causal Order from Single-Variable Interventions: Guarantees & Algorithm
Mathieu Chevalley, Patrick Schwab, Arash Mehrjou

TL;DR
This paper introduces a new method for inferring causal order from datasets with many single-variable interventions, providing theoretical guarantees and demonstrating superior empirical performance over existing methods.
Contribution
It proposes a novel interventional faithfulness assumption and an algorithm, Intersort, that effectively extracts causal order from large interventional datasets under realistic conditions.
Findings
Intersort outperforms baseline methods on simulated benchmark data.
Theoretical guarantees hold for large-scale datasets.
The new assumption enables effective causal inference from single-variable interventions.
Abstract
Targeted and uniform interventions to a system are crucial for unveiling causal relationships. While several methods have been developed to leverage interventional data for causal structure learning, their practical application in real-world scenarios often remains challenging. Recent benchmark studies have highlighted these difficulties, even when large numbers of single-variable intervention samples are available. In this work, we demonstrate, both theoretically and empirically, that such datasets contain a wealth of causal information that can be effectively extracted under realistic assumptions about the data distribution. More specifically, we introduce a novel variant of interventional faithfulness, which relies on comparisons between the marginal distributions of each variable across observational and interventional settings, and we introduce a score on causal orders. Under this…
Peer Reviews
Decision·ICLR 2025 Poster
Originality/Quality: The authors asserted at the end of the Related Work section that they are proposing the first algorithm to infer the causal order from the interventional data. I do not think this is true. First, (Tian and Pearl, 2013) is one of the earliest works using the changes in marginal distributions due to intervention in inferring some orders among the variables in the system (See Section 4 there). Second, there are extensive works on recovering an equivalence class of models from t
Comparison with previous work: I think there is no clear comparison with previous work (especially with (Tian and Pearl, 2013)) and discussion about the advantages of the current approach. Theoretical result in a very limited setting: I think the assumption of having single-variable intervention on all the variables is very restrictive (what can we say in theory about the recovered causal order if a portion of variables are intervened on?). Moreover, the proposed algorithm is designed under th
* The paper introduces a new definition of faithfulness with both theoretical guarantees and empirical results. * It is well-written and clear. * Numerical experiments are designed in a sensible manner that adequately supports the claims. * This is an important and highly relevant contribution to the community, with developed theory that has the potential to further advance causality.
* It’s unclear how the findings would generalize to different settings, such as different distributions over random interventional variables, or in the case of having discrete causal variables. * Empirical experiments are conducted on a limited set of underlying models.
This paper presents an original solution to a significant problem: a method for directly learning the causal order using interventional data. * Although technically dense, the paper is surprisingly easy to read and well-organized. * The theoretical guarantees for optimality under a reasonable faithfulness assumption appear solid. * The approximation algorithm is computationally tractable and validated with a variety of benchmarks.
* The majority of the analysis assumes access to oracle statistical distances between interventional distributions. Little attention is paid to the estimation of these distances using samples, and how they affect the sorting algorithm. * The main score objective (Equation 1) is difficult to understand and not explained much. Minor comments * Please define all the terms on like 152, like the noise variable $N_j$.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
