In defense of local descriptor-based few-shot object detection
Shichao Zhou, Haoyan Li, Zhuowei Wang, Zekai Zhang

TL;DR
This paper proposes a learning-free few-shot object detection method using refined local descriptors and brain-inspired feature representations.
Contribution
A novel few-shot detection framework using spatial contextual attention and Kernel-InfoNCE loss to enhance local descriptors.
Findings
Local descriptors can be improved with spatial contextual attention for better global structure understanding.
The proposed method achieves effective few-shot detection without intensive training on remote sensing images.
The model uses non-parametric similarity computation for accelerated detection.
Abstract
State-of-the-art image object detection computational models require an intensive parameter fine-tuning stage (using deep convolution network, etc). with tens or hundreds of training examples. In contrast, human intelligence can robustly learn a new concept from just a few instances (i.e., few-shot detection). The distinctive perception mechanisms between these two families of systems enlighten us to revisit classical handcraft local descriptors (e.g., SIFT, HOG, etc.) as well as non-parametric visual models, which innately require no learning/training phase. Herein, we claim that the inferior performance of these local descriptors mainly results from a lack of global structure sense. To address this issue, we refine local descriptors with spatial contextual attention of neighbor affinities and then embed the local descriptors into discriminative subspace guided by Kernel-InfoNCE loss.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
