Semantics-Guided Contrastive Network for Zero-Shot Object detection
Caixia Yan, Xiaojun Chang, Minnan Luo, Huan Liu, Xiaoqin Zhang, and, Qinghua Zheng

TL;DR
This paper introduces ContrastZSD, a novel semantics-guided contrastive network for zero-shot object detection that improves the discrimination and generalization to unseen categories by leveraging semantic supervision.
Contribution
The paper proposes a new contrastive learning framework for ZSD that incorporates semantic guidance to better utilize unseen class information and structure visual features.
Findings
Outperforms state-of-the-art on PASCAL VOC and MS COCO benchmarks.
Effectively reduces bias towards seen categories in ZSD.
Enhances visual-semantic alignment for improved detection of unseen objects.
Abstract
Zero-shot object detection (ZSD), the task that extends conventional detection models to detecting objects from unseen categories, has emerged as a new challenge in computer vision. Most existing approaches tackle the ZSD task with a strict mapping-transfer strategy, which may lead to suboptimal ZSD results: 1) the learning process of those models ignores the available unseen class information, and thus can be easily biased towards the seen categories; 2) the original visual feature space is not well-structured and lack of discriminative information. To address these issues, we develop a novel Semantics-Guided Contrastive Network for ZSD, named ContrastZSD, a detection framework that first brings contrastive learning mechanism into the realm of zero-shot detection. Particularly, ContrastZSD incorporates two semantics-guided contrastive learning subnets that contrast between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning
