Telling Cause from Effect using MDL-based Local and Global Regression
Alexander Marx, Jilles Vreeken

TL;DR
This paper introduces an MDL-based method for causal inference between two variables by comparing description lengths of their functional relations, and presents an efficient algorithm that outperforms existing methods.
Contribution
It proposes a novel MDL-based regression approach for causal inference and introduces Slope, a linear-time algorithm that significantly improves accuracy over prior methods.
Findings
Slope outperforms state-of-the-art methods on synthetic data.
The MDL-based approach effectively infers causal direction.
The method is efficient and applicable to real-world data.
Abstract
We consider the fundamental problem of inferring the causal direction between two univariate numeric random variables and from observational data. The two-variable case is especially difficult to solve since it is not possible to use standard conditional independence tests between the variables. To tackle this problem, we follow an information theoretic approach based on Kolmogorov complexity and use the Minimum Description Length (MDL) principle to provide a practical solution. In particular, we propose a compression scheme to encode local and global functional relations using MDL-based regression. We infer causes in case it is shorter to describe as a function of than the inverse direction. In addition, we introduce Slope, an efficient linear-time algorithm that through thorough empirical evaluation on both synthetic and real world data we show outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
