Causal Learning in Biomedical Applications: Krebs Cycle as a Benchmark
Xiaoyu He, Petr Ry\v{s}av\'y, Jakub Mare\v{c}ek

TL;DR
This paper introduces a new, realistic benchmark dataset based on the Krebs cycle for evaluating causal discovery methods on time series data, addressing limitations of synthetic datasets.
Contribution
It provides a reproducible, interpretable, and complex benchmark dataset with ground-truth causal graphs, supporting diverse evaluation metrics and multiple causal discovery methods.
Findings
14 causal discovery methods evaluated
Performance varies across different scenarios
Benchmark facilitates fair comparison and development
Abstract
Learning causal relationships from time series data is an important but challenging problem. Existing synthetic datasets often contain hidden artifacts that can be exploited by causal discovery methods, reducing their usefulness for benchmarking. We present a new benchmark dataset based on simulations of the Krebs cycle, a key biochemical pathway. The data are generated using a particle-based simulator that models molecular interactions in a controlled environment. Four distinct scenarios are provided, varying in time series length, number of samples, and intervention settings. The benchmark includes ground-truth causal graphs for evaluation. It supports quantitative comparisons using metrics such as Structural Hamming Distance, Structural Intervention Distance, and F1-score. A comprehensive evaluation of 14 causal discovery methods from different modelling paradigms is presented.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems
MethodsSparse Evolutionary Training
