Global Motion Understanding in Large-Scale Video Object Segmentation
Volodymyr Fedynyak, Yaroslav Romanus, Oles Dobosevych, Igor Babin,, Roman Riazantsev

TL;DR
This paper introduces WarpFormer, a semi-supervised video object segmentation method that leverages global motion knowledge from optical flow to improve accuracy and robustness in complex scenarios, trained on a large-scale dataset.
Contribution
WarpFormer integrates pre-trained optical flow estimation to enhance segmentation propagation, utilizing large-scale training data for improved performance in complex video scenarios.
Findings
Achieves 93.0% on DAVIS 2016 validation
Outperforms state-of-the-art on DAVIS 2017 test-dev
Demonstrates strong results on YouTube-VOS 2019 validation
Abstract
In this paper, we show that transferring knowledge from other domains of video understanding combined with large-scale learning can improve robustness of Video Object Segmentation (VOS) under complex circumstances. Namely, we focus on integrating scene global motion knowledge to improve large-scale semi-supervised Video Object Segmentation. Prior works on VOS mostly rely on direct comparison of semantic and contextual features to perform dense matching between current and past frames, passing over actual motion structure. On the other hand, Optical Flow Estimation task aims to approximate the scene motion field, exposing global motion patterns which are typically undiscoverable during all pairs similarity search. We present WarpFormer, an architecture for semi-supervised Video Object Segmentation that exploits existing knowledge in motion understanding to conduct smoother propagation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Vision and Imaging · Video Surveillance and Tracking Methods
MethodsFocus · VOS
