TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration
Kunyu Peng, Alina Roitberg, Kailun Yang, Jiaming Zhang, Rainer, Stiefelhagen

TL;DR
TransDARC introduces a transformer-based driver activity recognition framework with a novel feature calibration module that enhances generalization across different sensor setups and improves recognition accuracy in vehicle environments.
Contribution
It proposes a new vision-based transformer framework with latent space feature calibration for driver activity recognition, addressing sensor variability and improving generalization.
Findings
Outperforms previous state-of-the-art on Drive&Act benchmark
Improves recognition accuracy across all activity granularity levels
Enhances model robustness to sensor and vehicle variations
Abstract
Traditional video-based human activity recognition has experienced remarkable progress linked to the rise of deep learning, but this effect was slower as it comes to the downstream task of driver behavior understanding. Understanding the situation inside the vehicle cabin is essential for Advanced Driving Assistant System (ADAS) as it enables identifying distraction, predicting driver's intent and leads to more convenient human-vehicle interaction. At the same time, driver observation systems face substantial obstacles as they need to capture different granularities of driver states, while the complexity of such secondary activities grows with the rising automation and increased driver freedom. Furthermore, a model is rarely deployed under conditions identical to the ones in the training set, as sensor placements and types vary from vehicle to vehicle, constituting a substantial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSleep and Work-Related Fatigue · Human-Automation Interaction and Safety · Context-Aware Activity Recognition Systems
