Causal discovery on vector-valued variables and consistency-guided aggregation

Urmi Ninad; Jonas Wahl; Andreas Gerhardus; Jakob Runge

arXiv:2505.10476·stat.ME·May 16, 2025

Causal discovery on vector-valued variables and consistency-guided aggregation

Urmi Ninad, Jonas Wahl, Andreas Gerhardus, Jakob Runge

PDF

Open Access

TL;DR

This paper investigates causal discovery with vector-valued variables, emphasizing the importance of aggregation consistency and proposing a method to optimize aggregation scores for more reliable causal inference.

Contribution

It introduces aggregation consistency scores and a wrapper method, Adag, to improve causal discovery on vector-valued data by optimizing aggregation reliability.

Findings

01

Aggregation scores quantify aggregation soundness.

02

Adag improves causal discovery accuracy.

03

Experimental validation on synthetic data supports the approach.

Abstract

Causal discovery (CD) aims to discover the causal graph underlying the data generation mechanism of observed variables. In many real-world applications, the observed variables are vector-valued, such as in climate science where variables are defined over a spatial grid and the task is called spatio-temporal causal discovery. We motivate CD in vector-valued variable setting while considering different possibilities for the underlying model, and highlight the pitfalls of commonly-used approaches when compared to a fully vectorized approach. Furthermore, often the vector-valued variables are high-dimensional, and aggregations of the variables, such as averages, are considered in interest of efficiency and robustness. In the absence of interventional data, testing for the soundness of aggregate variables as consistent abstractions that map a low-level to a high-level structural causal model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Mining Algorithms and Applications · Bayesian Modeling and Causal Inference