Test-Time Training for Semantic Segmentation with Output Contrastive Loss
Yunlong Zhang, Yuxuan Sun, Sunyi Zheng, Zhongyi Shui and, Chenglu Zhu, Lin Yang

TL;DR
This paper introduces Output Contrastive Loss (OCL), a novel test-time training method for semantic segmentation that stabilizes adaptation and improves generalization to unseen environments by adapting contrastive loss to the output space.
Contribution
It adapts contrastive loss to the output space for test-time training in semantic segmentation, enhancing stability and generalization during online adaptation.
Findings
OCL improves segmentation performance across diverse scenarios.
The method is effective even with models pre-trained via domain adaptation.
OCL demonstrates robustness and adaptability in unseen environments.
Abstract
Although deep learning-based segmentation models have achieved impressive performance on public benchmarks, generalizing well to unseen environments remains a major challenge. To improve the model's generalization ability to the new domain during evaluation, the test-time training (TTT) is a challenging paradigm that adapts the source-pretrained model in an online fashion. Early efforts on TTT mainly focus on the image classification task. Directly extending these methods to semantic segmentation easily experiences unstable adaption due to segmentation's inherent characteristics, such as extreme class imbalance and complex decision spaces. To stabilize the adaptation process, we introduce contrastive loss (CL), known for its capability to learn robust and generalized representations. Nevertheless, the traditional CL operates in the representation space and cannot directly enhance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
MethodsFocus
