UniT: Unified Knowledge Transfer for Any-shot Object Detection and   Segmentation

Siddhesh Khandelwal; Raghav Goyal; Leonid Sigal

arXiv:2006.07502·cs.CV·March 5, 2021

UniT: Unified Knowledge Transfer for Any-shot Object Detection and Segmentation

Siddhesh Khandelwal, Raghav Goyal, Leonid Sigal

PDF

TL;DR

UniT introduces a unified semi-supervised framework that effectively transfers knowledge across different levels of supervision for object detection and segmentation, enabling robust performance from zero to few-shot scenarios.

Contribution

It proposes an end-to-end trainable model that bridges weakly-supervised and fully-supervised methods, leveraging class similarities for improved novel class detection and segmentation.

Findings

01

Enhanced performance across zero to few-shot settings

02

Effective knowledge transfer between base and novel classes

03

Consistent improvements on MS-COCO and Pascal VOC datasets

Abstract

Methods for object detection and segmentation rely on large scale instance-level annotations for training, which are difficult and time-consuming to collect. Efforts to alleviate this look at varying degrees and quality of supervision. Weakly-supervised approaches draw on image-level labels to build detectors/segmentors, while zero/few-shot methods assume abundant instance-level data for a set of base classes, and none to a few examples for novel classes. This taxonomy has largely siloed algorithmic designs. In this work, we aim to bridge this divide by proposing an intuitive and unified semi-supervised model that is applicable to a range of supervision: from zero to a few instance-level samples per novel class. For base classes, our model learns a mapping from weakly-supervised to fully-supervised detectors/segmentors. By learning and leveraging visual and lingual similarities between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.