We're Not Using Videos Effectively: An Updated Domain Adaptive Video   Segmentation Baseline

Simar Kareer; Vivek Vijaykumar; Harsh Maheshwari; Prithvijit; Chattopadhyay; Judy Hoffman; Viraj Prabhu

arXiv:2402.00868·cs.CV·February 29, 2024·2 cites

We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline

Simar Kareer, Vivek Vijaykumar, Harsh Maheshwari, Prithvijit, Chattopadhyay, Judy Hoffman, Viraj Prabhu

PDF

Open Access 1 Repo

TL;DR

This paper reveals that state-of-the-art image domain adaptation methods outperform video-specific methods on benchmarks, highlighting the need for better Video-DAS techniques and providing a unified codebase for fair comparison.

Contribution

It provides a comprehensive benchmark comparing Image-DAS and Video-DAS, showing current Video-DAS methods are outperformed by Image-DAS, and releases a unified codebase for future research.

Findings

01

Image-DAS methods outperform Video-DAS on benchmarks

02

Naive combinations of DAS techniques yield marginal improvements

03

Open-source code supports comprehensive evaluation

Abstract

There has been abundant work in unsupervised domain adaptation for semantic segmentation (DAS) seeking to adapt a model trained on images from a labeled source domain to an unlabeled target domain. While the vast majority of prior work has studied this as a frame-level Image-DAS problem, a few Video-DAS works have sought to additionally leverage the temporal signal present in adjacent frames. However, Video-DAS works have historically studied a distinct set of benchmarks from Image-DAS, with minimal cross-benchmarking. In this work, we address this gap. Surprisingly, we find that (1) even after carefully controlling for data and model architecture, state-of-the-art Image-DAS methods (HRDA and HRDA+MIC) outperform Video-DAS methods on established Video-DAS benchmarks (+14.5 mIoU on Viper $\to$ CityscapesSeq, +19.0 mIoU on Synthia $\to$ CityscapesSeq), and (2) naive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

simarkareer/unifiedvideoda
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Analysis and Summarization · Image and Video Quality Assessment

MethodsSparse Evolutionary Training