Experience feedback using Representation Learning for Few-Shot Object Detection on Aerial Images
Pierre Le Jeune, Mustapha Lebbah, Anissa Mokraoui, Hanene Azzag

TL;DR
This paper introduces a few-shot object detection approach for aerial images using Faster R-CNN with prototypical networks, enabling online adaptation to new classes through representation learning and episodic training.
Contribution
It presents a novel few-shot detection method combining Faster R-CNN and prototypical networks, with training strategies tailored for aerial image datasets.
Findings
Effective adaptation to new classes demonstrated on DOTA dataset
Identified intrinsic weaknesses in few-shot object detection tasks
Proposed strategies improve detection performance in limited data scenarios
Abstract
This paper proposes a few-shot method based on Faster R-CNN and representation learning for object detection in aerial images. The two classification branches of Faster R-CNN are replaced by prototypical networks for online adaptation to new classes. These networks produce embeddings vectors for each generated box, which are then compared with class prototypes. The distance between an embedding and a prototype determines the corresponding classification score. The resulting networks are trained in an episodic manner. A new detection task is randomly sampled at each epoch, consisting in detecting only a subset of the classes annotated in the dataset. This training strategy encourages the network to adapt to new classes as it would at test time. In addition, several ideas are explored to improve the proposed method such as a hard negative examples mining strategy and self-supervised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsTest · Softmax · Convolution · RoIPool · Region Proposal Network · Faster R-CNN
