Fast Causal Discovery by Approximate Kernel-based Generalized Score Functions with Linear Computational Complexity
Yixin Ren, Haocheng Zhang, Yewei Xia, Hao Zhang, Jihong Guan, Shuigeng Zhou

TL;DR
This paper introduces a fast, scalable causal discovery method using an approximate kernel-based score function with linear computational complexity, enabling efficient analysis of large datasets without sacrificing accuracy.
Contribution
It proposes a novel low-rank approximation technique and sampling algorithms to reduce the computational complexity of kernel-based causal discovery from cubic to linear time and space.
Findings
Significantly reduces computational costs for causal discovery.
Achieves comparable accuracy to state-of-the-art methods on large datasets.
Effective on both synthetic and real-world data.
Abstract
Score-based causal discovery methods can effectively identify causal relationships by evaluating candidate graphs and selecting the one with the highest score. One popular class of scores is kernel-based generalized score functions, which can adapt to a wide range of scenarios and work well in practice because they circumvent assumptions about causal mechanisms and data distributions. Despite these advantages, kernel-based generalized score functions pose serious computational challenges in time and space, with a time complexity of and a memory complexity of , where is the sample size. In this paper, we propose an approximate kernel-based generalized score function with time and space complexities by using low-rank technique and designing a set of rules to handle the complex composite matrix operations required to calculate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Rough Sets and Fuzzy Logic · Imbalanced Data Classification Techniques
MethodsSparse Evolutionary Training
