Can Attribution Predict Risk? From Multi-View Attribution to Planning Risk Signals in End-to-End Autonomous Driving
Le Yang, Ruoyu Chen, Haijun Liu, Jiawei Liang, ShangQuan Sun, Xiaochun Cao

TL;DR
This paper introduces a hierarchical attribution framework for end-to-end autonomous driving models, linking attribution maps to planning risk signals and demonstrating their effectiveness in risk prediction and generalization.
Contribution
It develops a coarse-to-fine region attribution method and extracts novel attribution statistics that predict planning risks in autonomous driving.
Findings
Attribution entropy correlates with trajectory error (Spearman 0.30).
Cross-camera Gini coefficient predicts collision risk (AUROC 0.77).
Risk signals generalize well to new scenes with minimal performance loss.
Abstract
End-to-end autonomous driving models generate future trajectories from multi-view inputs, improving system integration but introducing opaque decisions and hard-to-localize risks. Existing methods either rely on auxiliary monitoring models or generate textual explanations, but are decoupled from the planning process and fail to reveal the visual evidence underlying trajectory generation. While attribution offers a direct alternative, planning differs from image classification by taking six-view camera images as input and predicting continuous multi-step trajectories, requiring attribution to capture both critical views and regions and their influence on outputs. Moreover, whether attribution maps can support risk identification remains underexplored. To address this, we propose a hierarchical attribution framework for end-to-end planning. Specifically, using L2 consistency with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
