
TL;DR
This paper develops an information-theoretic framework for asymptotic causal inference, analyzing the structure and entropy of large causal models, and proposes optimized experimental designs for large datasets.
Contribution
It introduces a novel asymptotic framework using structural and semantic entropy, generalizes bipartite causal designs to k-partite architectures, and analyzes phase transitions in DAG complexity.
Findings
Almost all models are two-layer DAGs for low edge density.
Phase transitions lead to deeper DAG architectures at higher densities.
Provides bounds on sample complexity for causal influence testing.
Abstract
We investigate causal inference in the asymptotic regime as the number of variables approaches infinity using an information-theoretic framework. We define structural entropy of a causal model in terms of its description complexity measured by the logarithmic growth rate, measured in bits, of all directed acyclic graphs (DAGs), parameterized by the edge density d. Structural entropy yields non-intuitive predictions. If we randomly sample a DAG from the space of all models, in the range d = (0, 1/8), almost surely the model is a two-layer DAG! Semantic entropy quantifies the reduction in entropy where edges are removed by causal intervention. Semantic causal entropy is defined as the f-divergence between the observational distribution and the interventional distribution P', where a subset S of edges are intervened on to determine their causal influence. We compare the decomposability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhilosophy and History of Science
MethodsTest
