Learning to Reduce Information Bottleneck for Object Detection in Aerial Images
Yuchen Shen, Dong Zhang, Zhihao Song, Xuesong Jiang, Qiaolin Ye

TL;DR
This paper introduces GSNet, a novel global semantic network with a feature fusion module to improve object detection in aerial images by addressing information bottleneck and semantic gap issues, achieving better accuracy with less computation.
Contribution
The paper proposes GSNet and FRM, pioneering the use of neck networks to reduce information bottleneck in aerial image object detection.
Findings
Improved detection accuracy on DOTA and HRSC2016 datasets.
Reduced computational costs compared to existing methods.
Effective feature fusion addressing semantic gaps.
Abstract
Object detection in aerial images is a fundamental research topic in the geoscience and remote sensing domain. However, the advanced approaches on this topic mainly focus on designing the elaborate backbones or head networks but ignore neck networks. In this letter, we first underline the importance of the neck network in object detection from the perspective of information bottleneck. Then, to alleviate the information deficiency problem in the current approaches, we propose a global semantic network (GSNet), which acts as a bridge from the backbone network to the head network in a bidirectional global pattern. Compared to the existing approaches, our model can capture the rich and enhanced image features with less computational costs. Besides, we further propose a feature fusion refinement module (FRM) for different levels of features, which are suffering from the problem of semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
MethodsConvolution
