Better YOLO with Attention-Augmented Network and Enhanced Generalization Performance for Safety Helmet Detection
Shuqi Shen, Junjie Yang

TL;DR
This paper introduces an improved YOLO-based safety helmet detection model that combines GhostNetv2, attention modules, and a specialized optimizer to enhance accuracy, efficiency, and adaptability in complex environments.
Contribution
It presents a novel lightweight network with attention mechanisms and a new optimizer, achieving higher accuracy and better generalization for helmet detection.
Findings
2% improvement in mAP over baseline
Reduced parameters and Flops by over 25%
Enhanced robustness and speed in complex scenes
Abstract
Safety helmets play a crucial role in protecting workers from head injuries in construction sites, where potential hazards are prevalent. However, currently, there is no approach that can simultaneously achieve both model accuracy and performance in complex environments. In this study, we utilized a Yolo-based model for safety helmet detection, achieved a 2% improvement in mAP (mean Average Precision) performance while reducing parameters and Flops count by over 25%. YOLO(You Only Look Once) is a widely used, high-performance, lightweight model architecture that is well suited for complex environments. We presents a novel approach by incorporating a lightweight feature extraction network backbone based on GhostNetv2, integrating attention modules such as Spatial Channel-wise Attention Net(SCNet) and Coordination Attention Net(CANet), and adopting the Gradient Norm Aware optimizer (GAM)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic and Road Safety · Fire Detection and Safety Systems · Injury Epidemiology and Prevention
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Generalized additive models · Attentive Walk-Aggregating Graph Neural Network
