Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free   Domain Adaptation for Video Semantic Segmentation

Shao-Yuan Lo; Poojan Oza; Sumanth Chennupati; Alejandro Galindo,; Vishal M. Patel

arXiv:2303.14361·cs.CV·March 28, 2023·1 cites

Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic Segmentation

Shao-Yuan Lo, Poojan Oza, Sumanth Chennupati, Alejandro Galindo,, Vishal M. Patel

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel spatio-temporal pixel-level contrastive learning method for source-free video semantic segmentation, leveraging temporal information to improve adaptation without source data, achieving state-of-the-art results.

Contribution

It proposes the first spatio-temporal pixel-level contrastive learning approach for source-free video semantic segmentation, enhancing adaptation by exploiting temporal correlations.

Findings

01

Achieves state-of-the-art performance on VSS benchmarks.

02

Outperforms existing UDA and SFDA methods in video segmentation.

03

Effectively leverages spatio-temporal information for domain adaptation.

Abstract

Unsupervised Domain Adaptation (UDA) of semantic segmentation transfers labeled source knowledge to an unlabeled target domain by relying on accessing both the source and target data. However, the access to source data is often restricted or infeasible in real-world scenarios. Under the source data restrictive circumstances, UDA is less practical. To address this, recent works have explored solutions under the Source-Free Domain Adaptation (SFDA) setup, which aims to adapt a source-trained model to the target domain without accessing source data. Still, existing SFDA approaches use only image-level information for adaptation, making them sub-optimal in video applications. This paper studies SFDA for Video Semantic Segmentation (VSS), where temporal information is leveraged to address video adaptation. Specifically, we propose Spatio-Temporal Pixel-Level (STPL) contrastive learning, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shaoyuanlo/stpl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications