Zero-Shot Aerial Object Detection with Visual Description Regularization
Zhengqing Zang, Chenyu Lin, Chenwei Tang, Tao Wang, Jiancheng Lv

TL;DR
This paper introduces DescReg, a zero-shot aerial object detection method that leverages visual description regularization to improve detection of unseen classes without extensive annotations.
Contribution
It proposes a novel similarity-aware triplet loss to incorporate prior visual similarities from descriptions, addressing the semantic-visual correlation gap in zero-shot detection.
Findings
DescReg outperforms state-of-the-art ZSD methods on DIOR, xView, and DOTA datasets.
It achieves 4.5 mAP improvement on unseen classes in DIOR.
The method generalizes well across different detection architectures.
Abstract
Existing object detection models are mainly trained on large-scale labeled datasets. However, annotating data for novel aerial object classes is expensive since it is time-consuming and may require expert knowledge. Thus, it is desirable to study label-efficient object detection methods on aerial images. In this work, we propose a zero-shot method for aerial object detection named visual Description Regularization, or DescReg. Concretely, we identify the weak semantic-visual correlation of the aerial objects and aim to address the challenge with prior descriptions of their visual appearance. Instead of directly encoding the descriptions into class embedding space which suffers from the representation gap problem, we propose to infuse the prior inter-class visual similarity conveyed in the descriptions into the embedding learning. The infusion process is accomplished with a newly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsInfrared Target Detection Methodologies · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
MethodsTriplet Loss
