SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients
Tushar Verma, Jyotsna Singh, Yash Bhartari, Rishi Jarwal, Suraj Singh,, Shubhkarman Singh

TL;DR
This paper introduces innovative lightweight models and frameworks, including Programmable Gradient Information and a bidirectional State Space Model, to significantly improve small object detection accuracy and efficiency in aerial imagery.
Contribution
The paper presents a novel combination of the SAHI framework with YOLO v9 using PGI, and a new bidirectional State Space Model, advancing small object detection in aerial images.
Findings
Enhanced detection accuracy in aerial imagery
Improved processing efficiency for real-time applications
Validation of methods across diverse scenarios
Abstract
Small object detection in aerial imagery presents significant challenges in computer vision due to the minimal data inherent in small-sized objects and their propensity to be obscured by larger objects and background noise. Traditional methods using transformer-based models often face limitations stemming from the lack of specialized databases, which adversely affect their performance with objects of varying orientations and scales. This underscores the need for more adaptable, lightweight models. In response, this paper introduces two innovative approaches that significantly enhance detection and segmentation capabilities for small aerial objects. Firstly, we explore the use of the SAHI framework on the newly introduced lightweight YOLO v9 architecture, which utilizes Programmable Gradient Information (PGI) to reduce the substantial information loss typically encountered in sequential…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
