A Novel Driver Distraction Behavior Detection Method Based on   Self-supervised Learning with Masked Image Modeling

Yingzhi Zhang; Taiguo Li; Chao Li; Xinghong Zhou

arXiv:2306.00543·cs.CV·July 14, 2023·1 cites

A Novel Driver Distraction Behavior Detection Method Based on Self-supervised Learning with Masked Image Modeling

Yingzhi Zhang, Taiguo Li, Chao Li, Xinghong Zhou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a self-supervised learning approach using masked image modeling and Swin Transformer to detect driver distraction behaviors, reducing reliance on labeled datasets and achieving high accuracy.

Contribution

It proposes a novel self-supervised framework with masked image modeling and Swin Transformer for driver distraction detection, improving generalization and reducing labeling costs.

Findings

01

Achieved 99.60% accuracy on a large-scale dataset.

02

Outperformed traditional supervised methods in recognition ability.

03

Enhanced model efficiency through reconfigured Swin Transformer architecture.

Abstract

Driver distraction causes a significant number of traffic accidents every year, resulting in economic losses and casualties. Currently, the level of automation in commercial vehicles is far from completely unmanned, and drivers still play an important role in operating and controlling the vehicle. Therefore, driver distraction behavior detection is crucial for road safety. At present, driver distraction detection primarily relies on traditional convolutional neural networks (CNN) and supervised learning methods. However, there are still challenges such as the high cost of labeled datasets, limited ability to capture high-level semantic information, and weak generalization performance. In order to solve these problems, this paper proposes a new self-supervised learning method based on masked image modeling for driver distraction behavior detection. Firstly, a self-supervised learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rocky1salady-killer/sl-ddbd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Human-Automation Interaction and Safety

MethodsMulti-Head Attention · Attention Is All You Need · Test · Residual Connection · Linear Layer · Stochastic Depth · Layer Normalization · Byte Pair Encoding · Softmax · Label Smoothing