Better YOLO with Attention-Augmented Network and Enhanced Generalization   Performance for Safety Helmet Detection

Shuqi Shen; Junjie Yang

arXiv:2405.02591·cs.CV·May 7, 2024

Better YOLO with Attention-Augmented Network and Enhanced Generalization Performance for Safety Helmet Detection

Shuqi Shen, Junjie Yang

PDF

Open Access

TL;DR

This paper introduces an improved YOLO-based safety helmet detection model that combines GhostNetv2, attention modules, and a specialized optimizer to enhance accuracy, efficiency, and adaptability in complex environments.

Contribution

It presents a novel lightweight network with attention mechanisms and a new optimizer, achieving higher accuracy and better generalization for helmet detection.

Findings

01

2% improvement in mAP over baseline

02

Reduced parameters and Flops by over 25%

03

Enhanced robustness and speed in complex scenes

Abstract

Safety helmets play a crucial role in protecting workers from head injuries in construction sites, where potential hazards are prevalent. However, currently, there is no approach that can simultaneously achieve both model accuracy and performance in complex environments. In this study, we utilized a Yolo-based model for safety helmet detection, achieved a 2% improvement in mAP (mean Average Precision) performance while reducing parameters and Flops count by over 25%. YOLO(You Only Look Once) is a widely used, high-performance, lightweight model architecture that is well suited for complex environments. We presents a novel approach by incorporating a lightweight feature extraction network backbone based on GhostNetv2, integrating attention modules such as Spatial Channel-wise Attention Net(SCNet) and Coordination Attention Net(CANet), and adopting the Gradient Norm Aware optimizer (GAM)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic and Road Safety · Fire Detection and Safety Systems · Injury Epidemiology and Prevention

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Generalized additive models · Attentive Walk-Aggregating Graph Neural Network