Semantic Segmentation via Pixel-to-Center Similarity Calculation
Dongyue Wu, Zilin Guo, Aoyan Li, Changqian Yu, Changxin Gao, Nong Sang

TL;DR
This paper introduces a novel similarity-based approach for semantic segmentation that adaptively generates class centers conditioned on scenes, effectively reducing intra-class variation and enhancing inter-class distinction.
Contribution
The paper proposes the CCS layer with ACCM to generate adaptive class centers and new loss functions, improving segmentation accuracy over existing CNN-based methods.
Findings
Outperforms state-of-the-art CNN-based methods in experiments.
Effectively reduces intra-class variation across different scenes.
Enhances inter-class distinction in pixel classification.
Abstract
Since the fully convolutional network has achieved great success in semantic segmentation, lots of works have been proposed focusing on extracting discriminative pixel feature representations. However, we observe that existing methods still suffer from two typical challenges, i.e. (i) large intra-class feature variation in different scenes, (ii) small inter-class feature distinction in the same scene. In this paper, we first rethink semantic segmentation from a perspective of similarity between pixels and class centers. Each weight vector of the segmentation head represents its corresponding semantic class in the whole dataset, which can be regarded as the embedding of the class center. Thus, the pixel-wise classification amounts to computing similarity in the final feature space between pixels and the class centers. Under this novel view, we propose a Class Center Similarity layer (CCS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
