CaTFormer: Causal Temporal Transformer with Dynamic Contextual Fusion for Driving Intention Prediction
Sirui Wang, Zhou Guan, Bingxi Zhao, Tongjia Gu, Jie Liu

TL;DR
CaTFormer is a novel causal temporal transformer model that improves driving intention prediction by explicitly modeling causal interactions and eliminating spurious correlations, leading to state-of-the-art results and better interpretability.
Contribution
It introduces a new framework with reciprocal delayed fusion, counterfactual residual encoding, and feature synthesis network for robust causal modeling in driving intention prediction.
Findings
Achieves state-of-the-art performance on Brain4Cars dataset.
Effectively captures complex causal temporal dependencies.
Enhances accuracy and transparency of predictions.
Abstract
Accurate prediction of driving intention is key to enhancing the safety and interactive efficiency of human-machine co-driving systems. It serves as a cornerstone for achieving high-level autonomous driving. However, current approaches remain inadequate for accurately modeling the complex spatiotemporal interdependencies and the unpredictable variability of human driving behavior. To address these challenges, we propose CaTFormer, a causal Temporal Transformer that explicitly models causal interactions between driver behavior and environmental context for robust intention prediction. Specifically, CaTFormer introduces a novel Reciprocal Delayed Fusion (RDF) mechanism for precise temporal alignment of interior and exterior feature streams, a Counterfactual Residual Encoding (CRE) module that systematically eliminates spurious correlations to reveal authentic causal dependencies, and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Human-Automation Interaction and Safety · EEG and Brain-Computer Interfaces
