A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image Modeling
Yingzhi Zhang, Taiguo Li, Chao Li, Xinghong Zhou

TL;DR
This paper introduces a self-supervised learning approach using masked image modeling and Swin Transformer to detect driver distraction behaviors, reducing reliance on labeled datasets and achieving high accuracy.
Contribution
It proposes a novel self-supervised framework with masked image modeling and Swin Transformer for driver distraction detection, improving generalization and reducing labeling costs.
Findings
Achieved 99.60% accuracy on a large-scale dataset.
Outperformed traditional supervised methods in recognition ability.
Enhanced model efficiency through reconfigured Swin Transformer architecture.
Abstract
Driver distraction causes a significant number of traffic accidents every year, resulting in economic losses and casualties. Currently, the level of automation in commercial vehicles is far from completely unmanned, and drivers still play an important role in operating and controlling the vehicle. Therefore, driver distraction behavior detection is crucial for road safety. At present, driver distraction detection primarily relies on traditional convolutional neural networks (CNN) and supervised learning methods. However, there are still challenges such as the high cost of labeled datasets, limited ability to capture high-level semantic information, and weak generalization performance. In order to solve these problems, this paper proposes a new self-supervised learning method based on masked image modeling for driver distraction behavior detection. Firstly, a self-supervised learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Human-Automation Interaction and Safety
MethodsMulti-Head Attention · Attention Is All You Need · Test · Residual Connection · Linear Layer · Stochastic Depth · Layer Normalization · Byte Pair Encoding · Softmax · Label Smoothing
