Semantics-Guided Contrastive Network for Zero-Shot Object detection

Caixia Yan; Xiaojun Chang; Minnan Luo; Huan Liu; Xiaoqin Zhang; and; Qinghua Zheng

arXiv:2109.06062·cs.CV·January 4, 2022

Semantics-Guided Contrastive Network for Zero-Shot Object detection

Caixia Yan, Xiaojun Chang, Minnan Luo, Huan Liu, Xiaoqin Zhang, and, Qinghua Zheng

PDF

TL;DR

This paper introduces ContrastZSD, a novel semantics-guided contrastive network for zero-shot object detection that improves the discrimination and generalization to unseen categories by leveraging semantic supervision.

Contribution

The paper proposes a new contrastive learning framework for ZSD that incorporates semantic guidance to better utilize unseen class information and structure visual features.

Findings

01

Outperforms state-of-the-art on PASCAL VOC and MS COCO benchmarks.

02

Effectively reduces bias towards seen categories in ZSD.

03

Enhances visual-semantic alignment for improved detection of unseen objects.

Abstract

Zero-shot object detection (ZSD), the task that extends conventional detection models to detecting objects from unseen categories, has emerged as a new challenge in computer vision. Most existing approaches tackle the ZSD task with a strict mapping-transfer strategy, which may lead to suboptimal ZSD results: 1) the learning process of those models ignores the available unseen class information, and thus can be easily biased towards the seen categories; 2) the original visual feature space is not well-structured and lack of discriminative information. To address these issues, we develop a novel Semantics-Guided Contrastive Network for ZSD, named ContrastZSD, a detection framework that first brings contrastive learning mechanism into the realm of zero-shot detection. Particularly, ContrastZSD incorporates two semantics-guided contrastive learning subnets that contrast between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning