EABI-DETR: An Efficient Aerial Small Object Detection Network
Fufang Li, Yuehua Zhang, Yuxuan Fan

TL;DR
This paper introduces EABI-DETR, a new model for detecting small objects in aerial images, which improves performance and efficiency compared to existing methods.
Contribution
The novel EABI-DETR model combines a lightweight backbone, bi-level feature fusion, and a new loss function for efficient aerial small object detection.
Findings
EABI-DETR outperforms RT-DETR by 6.2% and 5.1% on [email protected] and [email protected]:0.95 metrics.
The model maintains high inference efficiency while improving detection accuracy for small aerial objects.
Bi-level feature fusion and lightweight attention mechanisms enhance perception of small objects in complex aerial scenes.
Abstract
Small object detection, as an important research topic in computer vision, has been widely applied in aerial visual tasks such as remote sensing and UAV imagery. However, due to challenges such as small object size, large-scale variations, and complex backgrounds, existing detection models often struggle to capture fine-grained semantics and high-resolution texture information in aerial scenes, leading to limited performance. To address these issues, this paper proposes an efficient aerial small object detection model, EABI-DETR (Efficient Attention and Bi-level Integration DETR), based on the RT-DETR framework. The proposed model introduces systematic enhancements from three aspects: (1) A lightweight backbone network, C2f-EMA, is developed by integrating the C2f structure with an efficient multi-scale attention (EMA) mechanism. This design jointly models channel semantics and spatial…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
