Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding
Yi-Chieh Liu, Yung-An Hsieh, Min-Hung Chen, Chao-Han Huck Yang, Jesper, Tegner, Yi-Chang James Tsai

TL;DR
This paper introduces a Temporal Reasoning Block (TRB) to enhance 3D CNNs for driving behavior classification, enabling models to better focus on causal factors like traffic lights, thus improving accuracy and interpretability.
Contribution
The paper proposes the TRB module that improves causal reasoning in 3D CNNs for driving behavior prediction, with enhanced accuracy and interpretability over existing models.
Findings
TRB models achieved 86.3% accuracy, outperforming previous methods.
Attention saliency analysis showed TRB helps models focus on causes.
Visual explanations confirmed improved causal focus with TRB.
Abstract
Performing driving behaviors based on causal reasoning is essential to ensure driving safety. In this work, we investigated how state-of-the-art 3D Convolutional Neural Networks (CNNs) perform on classifying driving behaviors based on causal reasoning. We proposed a perturbation-based visual explanation method to inspect the models' performance visually. By examining the video attention saliency, we found that existing models could not precisely capture the causes (e.g., traffic light) of the specific action (e.g., stopping). Therefore, the Temporal Reasoning Block (TRB) was proposed and introduced to the models. With the TRB models, we achieved the accuracy of , which outperform the state-of-the-art 3D CNNs from previous works. The attention saliency also demonstrated that TRB helped models focus on the causes more precisely. With both numerical and visual evaluations,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
