RelationRS: Relationship Representation Network for Object Detection in Aerial Images
Zhiming Liu, Xuefei Zhang, Chongyang Liu, Hao Wang, Chao Sun, Bin Li,, Weifeng Sun, Pu Huang, Qingjun Li, Yu Liu, Haipeng Kuang, Jihong Xiu

TL;DR
RelationRS introduces a novel network that leverages scene semantics and multi-scale feature fusion to significantly improve object detection accuracy in complex aerial images.
Contribution
The paper proposes RelationRS, a relationship representation network with dual relationship and bridging visual modules for enhanced aerial image object detection.
Findings
Achieves state-of-the-art detection performance on aerial image datasets.
Effectively models relationships between different scene regions and scales.
Improves detection accuracy in complex backgrounds and scale variations.
Abstract
Object detection is a basic and important task in the field of aerial image processing and has gained much attention in computer vision. However, previous aerial image object detection approaches have insufficient use of scene semantic information between different regions of large-scale aerial images. In addition, complex background and scale changes make it difficult to improve detection accuracy. To address these issues, we propose a relationship representation network for object detection in aerial images (RelationRS): 1) Firstly, multi-scale features are fused and enhanced by a dual relationship module (DRM) with conditional convolution. The dual relationship module learns the potential relationship between features of different scales and learns the relationship between different scenes from different patches in a same iteration. In addition, the dual relationship module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
