Towards Robust Video Object Segmentation with Adaptive Object Calibration
Xiaohao Xu, Jinglu Wang, Xiang Ming, Yan Lu

TL;DR
This paper introduces a novel adaptive object calibration network for semi-supervised video object segmentation, enhancing robustness and discriminability of object representations and masks through progressive calibration and prototype aggregation.
Contribution
It proposes an adaptive object proxy aggregation and progressive mask calibration method to improve robustness and discriminability in video object segmentation.
Findings
Achieves state-of-the-art results on YouTube-VOS and DAVIS benchmarks.
Demonstrates superior robustness against perturbations.
Outperforms existing methods in accuracy and stability.
Abstract
In the booming video era, video segmentation attracts increasing research attention in the multimedia community. Semi-supervised video object segmentation (VOS) aims at segmenting objects in all target frames of a video, given annotated object masks of reference frames. Most existing methods build pixel-wise reference-target correlations and then perform pixel-wise tracking to obtain target masks. Due to neglecting object-level cues, pixel-level approaches make the tracking vulnerable to perturbations, and even indiscriminate among similar objects. Towards robust VOS, the key insight is to calibrate the representation and mask of each specific object to be expressive and discriminative. Accordingly, we propose a new deep network, which can adaptively construct object representations and calibrate object masks to achieve stronger robustness. First, we construct the object representations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
MethodsVOS
