Boosting Semantic Segmentation from the Perspective of Explicit Class   Embeddings

Yuhe Liu; Chuanjian Liu; Kai Han; Quan Tang; Zengchang Qin

arXiv:2308.12894·cs.CV·August 25, 2023·1 cites

Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings

Yuhe Liu, Chuanjian Liu, Kai Han, Quan Tang, Zengchang Qin

PDF

Open Access

TL;DR

This paper introduces ECENet, a novel semantic segmentation approach that explicitly generates and enhances class embeddings based on class masks, improving accuracy and efficiency by revisiting decoding and feature reconstruction processes.

Contribution

The paper proposes ECENet, a new segmentation paradigm that explicitly interacts with class masks to generate meaningful class embeddings and revisits decoding with inverted information flow.

Findings

01

ECENet outperforms existing methods on ADE20K with less computational cost.

02

Achieves state-of-the-art results on PASCAL-Context dataset.

03

Introduces a Feature Reconstruction module to enhance feature discriminability.

Abstract

Semantic segmentation is a computer vision task that associates a label with each pixel in an image. Modern approaches tend to introduce class embeddings into semantic segmentation for deeply utilizing category semantics, and regard supervised class masks as final predictions. In this paper, we explore the mechanism of class embeddings and have an insight that more explicit and meaningful class embeddings can be generated based on class masks purposely. Following this observation, we propose ECENet, a new segmentation paradigm, in which class embeddings are obtained and enhanced explicitly during interacting with multi-stage image features. Based on this, we revisit the traditional decoding process and explore inverted information flow between segmentation masks and class embeddings. Furthermore, to ensure the discriminability and informativity of features from backbone, we propose a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Digital Imaging for Blood Diseases · Multimodal Machine Learning Applications