SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes
Boshra Khalili, Andrew W.Smyth

TL;DR
SOD-YOLOv8 is a specialized model that improves small object detection in traffic scenes by enhancing feature fusion, adding a detection layer, and introducing new IoU metrics, achieving higher accuracy without significant computational costs.
Contribution
The paper introduces SOD-YOLOv8, a novel small object detection model that integrates multi-scale features, a new detection layer, and a custom IoU to improve accuracy in traffic scenarios.
Findings
Increases recall from 40.1% to 43.9%.
Improves [email protected] from 40.6% to 45.1%.
Enhances detection reliability in real-world traffic scenes.
Abstract
Object detection as part of computer vision can be crucial for traffic management, emergency response, autonomous vehicles, and smart cities. Despite significant advances in object detection, detecting small objects in images captured by distant cameras remains challenging due to their size, distance from the camera, varied shapes, and cluttered backgrounds. To address these challenges, we propose Small Object Detection YOLOv8 (SOD-YOLOv8), a novel model specifically designed for scenarios involving numerous small objects. Inspired by Efficient Generalized Feature Pyramid Networks (GFPN), we enhance multi-path fusion within YOLOv8 to integrate features across different levels, preserving details from shallower layers and improving small object detection accuracy. Also, A fourth detection layer is added to leverage high-resolution spatial information effectively. The Efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection
MethodsSoftmax · Attention Is All You Need · You Only Look Once
