Comprehensive Review and Empirical Evaluation of Causal Discovery Algorithms for Numerical Data
Wenjin Niu, Zijun Gao, Liyan Song, Lingbo Li

TL;DR
This paper provides a comprehensive review and empirical evaluation of causal discovery algorithms for numerical data, offering a structured taxonomy, performance insights, and practical guidelines based on dataset characteristics.
Contribution
It introduces a detailed taxonomy of causal discovery methods and conducts extensive empirical testing on diverse datasets, addressing previous fragmentation and evaluation gaps.
Findings
Dataset characteristics significantly influence algorithm performance.
Top-3 recommended algorithms vary by data scenario.
Metadata extraction achieves over 80% accuracy for method selection.
Abstract
Causal analysis has become an essential component in understanding the underlying causes of phenomena across various fields. Despite its significance, existing literature on causal discovery algorithms is fragmented, with inconsistent methodologies, i.e., there is no universal classification standard for existing methods, and a lack of comprehensive evaluations, i.e., data characteristics are often ignored to be jointly analyzed when benchmarking algorithms. This study addresses these gaps by conducting an exhaustive review and empirical evaluation for causal discovery methods on numerical data, aiming to provide a clearer and more structured understanding of the field. Our research begins with a comprehensive literature review spanning over two decades, analyzing over 200 academic articles and identifying more than 40 representative algorithms. This extensive analysis leads to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Machine Learning and Data Classification
