Recursive Training for Zero-Shot Semantic Segmentation

Ce Wang; Moshiur Farazi; Nick Barnes

arXiv:2103.00086·cs.CV·October 6, 2021

Recursive Training for Zero-Shot Semantic Segmentation

Ce Wang, Moshiur Farazi, Nick Barnes

PDF

TL;DR

This paper introduces a recursive training method with a novel ZS-MMD loss to improve zero-shot semantic segmentation, enabling models to better identify unseen classes with state-of-the-art results.

Contribution

It proposes a recursive training scheme using pseudo-feature supervision and ZS-MMD loss to enhance discriminative features for unseen classes in zero-shot segmentation.

Findings

01

Achieves state-of-the-art results on Pascal-VOC 2012.

02

Outperforms previous models in zero-shot segmentation accuracy.

03

Provides a new training paradigm for zero-shot semantic segmentation.

Abstract

General purpose semantic segmentation relies on a backbone CNN network to extract discriminative features that help classify each image pixel into a 'seen' object class (ie., the object classes available during training) or a background class. Zero-shot semantic segmentation is a challenging task that requires a computer vision model to identify image pixels belonging to an object class which it has never seen before. Equipping a general purpose semantic segmentation model to separate image pixels of 'unseen' classes from the background remains an open challenge. Some recent models have approached this problem by fine-tuning the final pixel classification layer of a semantic segmentation model for a Zero-Shot setting, but struggle to learn discriminative features due to the lack of supervision. We propose a recursive training scheme to supervise the retraining of a semantic segmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.