Cross-Camera Distracted Driver Classification through Feature Disentanglement and Contrastive Learning
Luigi Celona, Simone Bianco, Paolo Napoletano

TL;DR
This paper presents DBMNet, a robust driver distraction classification model that uses feature disentanglement and contrastive learning to improve generalization across different camera views and conditions.
Contribution
The paper introduces a lightweight, camera-view-invariant model with a disentanglement module and contrastive learning, achieving superior accuracy and deployment efficiency.
Findings
DBMNet improves Top-1 accuracy by 7% over existing methods.
It generalizes well across different datasets and camera views.
The model is efficient enough for deployment on edge devices.
Abstract
The classification of distracted drivers is pivotal for ensuring safe driving. Previous studies demonstrated the effectiveness of neural networks in automatically predicting driver distraction, fatigue, and potential hazards. However, recent research has uncovered a significant loss of accuracy in these models when applied to samples acquired under conditions that differ from the training data. In this paper, we introduce a robust model designed to withstand changes in camera position within the vehicle. Our Driver Behavior Monitoring Network (DBMNet) relies on a lightweight backbone and integrates a disentanglement module to discard camera view information from features, coupled with contrastive learning to enhance the encoding of various driver actions. Experiments conducted using a leave-one-camera-out protocol on the daytime and nighttime subsets of the 100-Driver dataset validate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
