Research on target detection method of distracted driving behavior based   on improved YOLOv8

Shiquan Shen; Zhizhong Wu; Pan Zhang

arXiv:2407.01864·cs.CV·July 8, 2024

Research on target detection method of distracted driving behavior based on improved YOLOv8

Shiquan Shen, Zhizhong Wu, Pan Zhang

PDF

Open Access

TL;DR

This paper presents an enhanced YOLOv8-based method for real-time detection of distracted driving behaviors, improving accuracy and efficiency through model modifications.

Contribution

The study introduces an improved YOLOv8 model incorporating BoTNet, GAM attention, and EIoU loss, optimizing feature extraction and fusion for better performance.

Findings

01

Detection accuracy of 99.4% achieved

02

Model size reduced for easier deployment

03

Real-time detection and classification demonstrated

Abstract

With the development of deep learning technology, the detection and classification of distracted driving behaviour requires higher accuracy. Existing deep learning-based methods are computationally intensive and parameter redundant, limiting the efficiency and accuracy in practical applications. To solve this problem, this study proposes an improved YOLOv8 detection method based on the original YOLOv8 model by integrating the BoTNet module, GAM attention mechanism and EIoU loss function. By optimising the feature extraction and multi-scale feature fusion strategies, the training and inference processes are simplified, and the detection accuracy and efficiency are significantly improved. Experimental results show that the improved model performs well in both detection speed and accuracy, with an accuracy rate of 99.4%, and the model is smaller and easy to deploy, which is able to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods

MethodsSoftmax · Attention Is All You Need · You Only Look Once · Generalized additive models · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings