CausalTime: Realistically Generated Time-series for Benchmarking of Causal Discovery
Yuxiao Cheng, Ziqian Wang, Tingxiong Xiao, Qin Zhong, Jinli Suo,, Kunlun He

TL;DR
CausalTime is a novel pipeline that generates realistic synthetic time-series data with known causal structures, enabling accurate benchmarking of causal discovery algorithms in real-world scenarios.
Contribution
This work introduces a comprehensive method combining neural networks and prior knowledge to produce realistic, ground-truth labeled time-series for evaluating causal discovery methods.
Findings
Generated data closely matches real data in experiments.
Benchmarking reveals strengths and weaknesses of existing TSCD algorithms.
The approach is applicable across various fields and scenarios.
Abstract
Time-series causal discovery (TSCD) is a fundamental problem of machine learning. However, existing synthetic datasets cannot properly evaluate or predict the algorithms' performance on real data. This study introduces the CausalTime pipeline to generate time-series that highly resemble the real data and with ground truth causal graphs for quantitative performance evaluation. The pipeline starts from real observations in a specific scenario and produces a matching benchmark dataset. Firstly, we harness deep neural networks along with normalizing flow to accurately capture realistic dynamics. Secondly, we extract hypothesized causal graphs by performing importance analysis on the neural network or leveraging prior knowledge. Thirdly, we derive the ground truth causal graphs by splitting the causal model into causal term, residual term, and noise term. Lastly, using the fitted network and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Time Series Analysis and Forecasting · Data Quality and Management
