Constraint-based Causal Discovery from Multiple Interventions over Overlapping Variable Sets
Sofia Triantafillou, Ioannis Tsamardinos

TL;DR
The paper introduces COmbINE, an algorithm that integrates multiple heterogeneous intervention data sets to infer invariant and variant causal structures, handling conflicting constraints and outperforming previous methods in efficiency.
Contribution
COmbINE is a novel algorithm that combines data from overlapping variable sets under different interventions to infer comprehensive causal models, accounting for conflicts and statistical errors.
Findings
COmbINE outperforms existing algorithms in efficiency.
It effectively handles conflicting constraints from statistical errors.
Demonstrated on real mass-cytometry data sets.
Abstract
Scientific practice typically involves repeatedly studying a system, each time trying to unravel a different perspective. In each study, the scientist may take measurements under different experimental conditions (interventions, manipulations, perturbations) and measure different sets of quantities (variables). The result is a collection of heterogeneous data sets coming from different data distributions. In this work, we present algorithm COmbINE, which accepts a collection of data sets over overlapping variable sets under different experimental conditions; COmbINE then outputs a summary of all causal models indicating the invariant and variant structural characteristics of all models that simultaneously fit all of the input data sets. COmbINE converts estimated dependencies and independencies in the data into path constraints on the data-generating causal model and encodes them as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Gene Regulatory Network Analysis · Gene expression and cancer classification
