GSA-YOLO: A High-Efficiency Framework via Structured Sparsity and Adaptive Knowledge Distillation for Real-Time X-ray Security Inspection
Jiahao Kong

TL;DR
GSA-YOLO is a lightweight, high-efficiency detection framework for real-time X-ray security inspection, combining structured sparsity and adaptive knowledge distillation to improve accuracy and speed.
Contribution
It introduces a novel integration of structured sparsity and adaptive knowledge transfer into YOLOv8n for enhanced robustness and efficiency in security inspection tasks.
Findings
Achieves 189.62 FPS inference speed.
Reduces computational cost from 8.7G to 8.0G.
Improves mAP50:95 by 2.4% and 1.8% on two datasets.
Abstract
X-ray security inspection requires accurate real-time detection of prohibited items, but existing models often struggle to balance the challenges of severe occlusion, complex clutter, and strict speed requirements. To overcome these challenges, this paper proposes GSA-YOLO, a novel lightweight framework built upon the YOLOv8n architecture, specifically engineered to enhance detection robustness and inference efficiency. GSA-YOLO strategically integrates structured sparsity and adaptive knowledge transfer through three core components: Group Lasso (GL) applied to the network neck for robust feature extraction; Sparse Structure Selection (SSS) applied to the detection head for significant model slimming; and an Adaptive Knowledge Distillation (Ada-KD) mechanism for comprehensive accuracy recovery. This integrated approach synergistically enhances feature representation while pruning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
