Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection

Chenchen Zhu; Fangyi Chen; Uzair Ahmed; Zhiqiang Shen; Marios Savvides

arXiv:2103.01903·cs.CV·March 23, 2021·22 cites

Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection

Chenchen Zhu, Fangyi Chen, Uzair Ahmed, Zhiqiang Shen, Marios Savvides

PDF

Open Access

TL;DR

This paper introduces SRR-FSD, a novel few-shot object detection method that leverages semantic relation reasoning with dynamic graphs, improving stability and performance especially in low-shot scenarios.

Contribution

It proposes a relation reasoning framework combining semantic embeddings and dynamic graphs to enhance few-shot object detection robustness.

Findings

01

SRR-FSD achieves competitive results at higher shots.

02

It significantly outperforms in low-shot settings.

03

The benchmark protocol with implicit shots offers a more realistic evaluation.

Abstract

Few-shot object detection is an imperative and long-lasting problem due to the inherent long-tail distribution of real-world data. Its performance is largely affected by the data scarcity of novel classes. But the semantic relation between the novel classes and the base classes is constant regardless of the data availability. In this work, we investigate utilizing this semantic relation together with the visual information and introduce explicit relation reasoning into the learning of novel object detection. Specifically, we represent each class concept by a semantic embedding learned from a large corpus of text. The detector is trained to project the image representations of objects into this embedding space. We also identify the problems of trivially using the raw embeddings with a heuristic knowledge graph and propose to augment the embeddings with a dynamic relation graph. As a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications