Context Label Learning: Improving Background Class Representations in Semantic Segmentation
Zeju Li, Konstantinos Kamnitsas, Cheng Ouyang, Chen Chen, Ben, Glocker

TL;DR
This paper introduces CoLab, a method that decomposes the background class into subclasses using an auxiliary network, enhancing background representation and significantly improving semantic segmentation accuracy.
Contribution
The paper proposes a novel auxiliary network approach to automatically generate context labels, addressing background heterogeneity in segmentation tasks.
Findings
CoLab improves segmentation accuracy across multiple datasets.
It helps background logits stay away from decision boundaries.
Enhanced background class representation reduces over-segmentation.
Abstract
Background samples provide key contextual information for segmenting regions of interest (ROIs). However, they always cover a diverse set of structures, causing difficulties for the segmentation model to learn good decision boundaries with high sensitivity and precision. The issue concerns the highly heterogeneous nature of the background class, resulting in multi-modal distributions. Empirically, we find that neural networks trained with heterogeneous background struggle to map the corresponding contextual samples to compact clusters in feature space. As a result, the distribution over background logit activations may shift across the decision boundary, leading to systematic over-segmentation across different datasets and tasks. In this study, we propose context label learning (CoLab) to improve the context representations by decomposing the background class into several subclasses.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
