Reliable Propagation-Correction Modulation for Video Object Segmentation
Xiaohao Xu, Jinglu Wang, Xiao Li, Yan Lu

TL;DR
This paper introduces a novel correction mechanism with dedicated modulators to reduce error propagation in online semi-supervised video object segmentation, achieving state-of-the-art results by effectively utilizing reliable cues and feature augmentation.
Contribution
It proposes a disentangled correction approach with propagation and correction modulators, along with feature augmentation and a reliability filter, to enhance segmentation accuracy and robustness.
Findings
Achieves state-of-the-art performance on YouTube-VOS18/19 and DAVIS17 benchmarks.
Demonstrates significant performance gains through the correction mechanism.
Validates the effectiveness of reliable cue augmentation and filtering.
Abstract
Error propagation is a general but crucial problem in online semi-supervised video object segmentation. We aim to suppress error propagation through a correction mechanism with high reliability. The key insight is to disentangle the correction from the conventional mask propagation process with reliable cues. We introduce two modulators, propagation and correction modulators, to separately perform channel-wise re-calibration on the target frame embeddings according to local temporal correlations and reliable references respectively. Specifically, we assemble the modulators with a cascaded propagation-correction scheme. This avoids overriding the effects of the reliable correction modulator by the propagation modulator. Although the reference frame with the ground truth label provides reliable cues, it could be very different from the target frame and introduce uncertain or incomplete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection · Advanced Neural Network Applications
