Query Adaptive Few-Shot Object Detection with Heterogeneous Graph Convolutional Networks
Guangxing Han, Yicheng He, Shiyuan Huang, Jiawei Ma, Shih-Fu Chang

TL;DR
This paper introduces QA-FewDet, a novel few-shot object detection model using heterogeneous graph convolutional networks to better model relationships among proposals and classes, leading to improved detection accuracy.
Contribution
The paper proposes a new FSOD approach leveraging heterogeneous graph convolutional networks for context-aware features and query-adaptive prototypes, enhancing pairwise matching.
Findings
Outperforms state-of-the-art on PASCAL VOC and MSCOCO benchmarks.
Effective message passing among proposals and classes improves detection accuracy.
Robust across different shots and evaluation metrics.
Abstract
Few-shot object detection (FSOD) aims to detect never-seen objects using few examples. This field sees recent improvement owing to the meta-learning techniques by learning how to match between the query image and few-shot class examples, such that the learned model can generalize to few-shot novel classes. However, currently, most of the meta-learning-based methods perform pairwise matching between query image regions (usually proposals) and novel classes separately, therefore failing to take into account multiple relationships among them. In this paper, we propose a novel FSOD model using heterogeneous graph convolutional networks. Through efficient message passing among all the proposal and class nodes with three different types of edges, we could obtain context-aware proposal features and query-adaptive, multiclass-enhanced prototype representations for each class, which could help…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
