Efficient Meta-Learning Enabled Lightweight Multiscale Few-Shot Object Detection in Remote Sensing Images
Wenbin Guan, Zijiu Yang, Xiaohong Wu, Liqiong Chen, Feng Huang,, Xiaohai He, Honggang Chen

TL;DR
This paper introduces a lightweight, meta-learning based multiscale few-shot object detection method for remote sensing images, improving accuracy and efficiency over existing models by leveraging a novel training framework and sample strategy.
Contribution
It proposes a novel meta-learning framework for lightweight one-stage detectors, specifically YOLOv7, tailored for multiscale few-shot object detection in remote sensing images.
Findings
Achieved superior detection accuracy on DIOR and NWPU VHR-10.v2 datasets.
Enhanced detection speed and model efficiency compared to traditional two-stage detectors.
Effectively utilized negative samples to improve meta-learning performance.
Abstract
Presently, the task of few-shot object detection (FSOD) in remote sensing images (RSIs) has become a focal point of attention. Numerous few-shot detectors, particularly those based on two-stage detectors, face challenges when dealing with the multiscale complexities inherent in RSIs. Moreover, these detectors present impractical characteristics in real-world applications, mainly due to their unwieldy model parameters when handling large amount of data. In contrast, we recognize the advantages of one-stage detectors, including high detection speed and a global receptive field. Consequently, we choose the YOLOv7 one-stage detector as a baseline and subject it to a novel meta-learning training framework. This transformation allows the detector to adeptly address FSOD tasks while capitalizing on its inherent advantage of lightweight. Additionally, we thoroughly investigate the samples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Image Processing Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
