AuxAdapt: Stable and Efficient Test-Time Adaptation for Temporally Consistent Video Semantic Segmentation
Yizhe Zhang, Shubhankar Borse, Hong Cai, Fatih Porikli

TL;DR
AuxAdapt is an efficient, unsupervised test-time adaptation method that improves temporal consistency in video segmentation without optical flow or extensive fine-tuning, using a small auxiliary network for stable updates.
Contribution
It introduces AuxAdapt, a novel online adaptation scheme that enhances temporal consistency in video segmentation by learning from model decisions with minimal computational overhead.
Findings
Achieves more temporally consistent segmentation results.
Reduces computational overhead by over 5 times compared to existing methods.
Demonstrates effectiveness on Cityscapes, CamVid, and KITTI benchmarks.
Abstract
In video segmentation, generating temporally consistent results across frames is as important as achieving frame-wise accuracy. Existing methods rely either on optical flow regularization or fine-tuning with test data to attain temporal consistency. However, optical flow is not always avail-able and reliable. Besides, it is expensive to compute. Fine-tuning the original model in test time is cost sensitive. This paper presents an efficient, intuitive, and unsupervised online adaptation method, AuxAdapt, for improving the temporal consistency of most neural network models. It does not require optical flow and only takes one pass of the video. Since inconsistency mainly arises from the model's uncertainty in its output, we propose an adaptation scheme where the model learns from its own segmentation decisions as it streams a video, which allows producing more confident and temporally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Advanced Vision and Imaging · Advanced Neural Network Applications
MethodsTest
