TL;DR
This paper introduces a universal class-aware regularization method for semantic segmentation that improves feature learning by explicitly optimizing class representations, leading to significant accuracy gains across models.
Contribution
It proposes a novel regularization approach with three loss functions that enhance intra-class compactness and inter-class separation, directly using ground truth class centers for better feature learning.
Findings
Boosts segmentation accuracy by up to 2.23% mIOU.
Applicable to various models like OCR and CPNet without extra inference cost.
Demonstrates superior generalization across multiple datasets.
Abstract
Recent segmentation methods, such as OCR and CPNet, utilizing "class level" information in addition to pixel features, have achieved notable success for boosting the accuracy of existing network modules. However, the extracted class-level information was simply concatenated to pixel features, without explicitly being exploited for better pixel representation learning. Moreover, these approaches learn soft class centers based on coarse mask prediction, which is prone to error accumulation. In this paper, aiming to use class level information more effectively, we propose a universal Class-Aware Regularization (CAR) approach to optimize the intra-class variance and inter-class distance during feature learning, motivated by the fact that humans can recognize an object by itself no matter which other objects it appears with. Three novel loss functions are proposed. The first loss function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
