Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic Segmentation
Shao-Yuan Lo, Poojan Oza, Sumanth Chennupati, Alejandro Galindo,, Vishal M. Patel

TL;DR
This paper introduces a novel spatio-temporal pixel-level contrastive learning method for source-free video semantic segmentation, leveraging temporal information to improve adaptation without source data, achieving state-of-the-art results.
Contribution
It proposes the first spatio-temporal pixel-level contrastive learning approach for source-free video semantic segmentation, enhancing adaptation by exploiting temporal correlations.
Findings
Achieves state-of-the-art performance on VSS benchmarks.
Outperforms existing UDA and SFDA methods in video segmentation.
Effectively leverages spatio-temporal information for domain adaptation.
Abstract
Unsupervised Domain Adaptation (UDA) of semantic segmentation transfers labeled source knowledge to an unlabeled target domain by relying on accessing both the source and target data. However, the access to source data is often restricted or infeasible in real-world scenarios. Under the source data restrictive circumstances, UDA is less practical. To address this, recent works have explored solutions under the Source-Free Domain Adaptation (SFDA) setup, which aims to adapt a source-trained model to the target domain without accessing source data. Still, existing SFDA approaches use only image-level information for adaptation, making them sub-optimal in video applications. This paper studies SFDA for Video Semantic Segmentation (VSS), where temporal information is leveraged to address video adaptation. Specifically, we propose Spatio-Temporal Pixel-Level (STPL) contrastive learning, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
