TL;DR
CausalCompass is a benchmark framework for evaluating the robustness of time-series causal discovery methods under assumption violations, revealing deep learning approaches often perform best across diverse scenarios.
Contribution
We introduce CausalCompass, a flexible benchmark for systematically assessing TSCD methods' robustness under assumption violations, including extensive experiments and analysis.
Findings
Deep learning-based methods outperform others across various assumption-violation scenarios.
NTS-NOTEARS performs poorly without preprocessing but excels after standardization.
No single method is best in all tested scenarios.
Abstract
Causal discovery from time series is a fundamental task in machine learning. However, its widespread adoption is hindered by a reliance on untestable causal assumptions and by the lack of robustness-oriented evaluation in existing benchmarks. To address these challenges, we propose CausalCompass, a flexible and extensible benchmark framework designed to assess the robustness of time-series causal discovery (TSCD) methods under violations of modeling assumptions. To demonstrate the practical utility of CausalCompass, we conduct extensive benchmarking of representative TSCD algorithms across eight assumption-violation scenarios. Our experimental results indicate that no single method consistently attains optimal performance across all settings. Nevertheless, the methods exhibiting superior overall performance across diverse scenarios are almost invariably deep learning-based approaches.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
