Transforming Causality: Transformer-Based Temporal Causal Discovery with Prior Knowledge Integration

Jihua Huang; Yi Yao; Ajay Divakaran

arXiv:2508.15928·cs.LG·August 25, 2025

Transforming Causality: Transformer-Based Temporal Causal Discovery with Prior Knowledge Integration

Jihua Huang, Yi Yao, Ajay Divakaran

PDF

TL;DR

This paper presents a Transformer-based framework for temporal causal discovery that effectively captures nonlinear relationships, incorporates prior knowledge to reduce spurious links, and outperforms existing methods in accuracy.

Contribution

It introduces a novel Transformer-based approach with prior knowledge integration for improved causal discovery and inference in time-series data.

Findings

01

12.8% improvement in F1-score for causal discovery

02

98.9% accuracy in estimating causal lags

03

Effective mitigation of spurious causal relationships

Abstract

We introduce a novel framework for temporal causal discovery and inference that addresses two key challenges: complex nonlinear dependencies and spurious correlations. Our approach employs a multi-layer Transformer-based time-series forecaster to capture long-range, nonlinear temporal relationships among variables. After training, we extract the underlying causal structure and associated time lags from the forecaster using gradient-based analysis, enabling the construction of a causal graph. To mitigate the impact of spurious causal relationships, we introduce a prior knowledge integration mechanism based on attention masking, which consistently enforces user-excluded causal links across multiple Transformer layers. Extensive experiments show that our method significantly outperforms other state-of-the-art approaches, achieving a 12.8% improvement in F1-score for causal discovery and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.