2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex   Video Object Segmentation

Zhensong Xu; Jiangtao Yao; Chengjing Wu; Ting Liu; Luoqi Liu

arXiv:2406.08192·cs.CV·June 13, 2024

2nd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation

Zhensong Xu, Jiangtao Yao, Chengjing Wu, Ting Liu, Luoqi Liu

PDF

Open Access

TL;DR

This paper presents a robust video object segmentation method that uses data augmentation, instance segmentation, and test-time strategies, achieving second place in the MOSE track of PVUW 2024 with high accuracy.

Contribution

The approach introduces novel data augmentation and inference techniques to improve segmentation of tiny, similar, and fast-moving objects in videos.

Findings

01

Achieved 2nd place in PVUW 2024 MOSE track.

02

Enhanced segmentation accuracy with data augmentation and TTA.

03

Improved robustness against motion blur.

Abstract

Complex video object segmentation serves as a fundamental task for a wide range of downstream applications such as video editing and automatic data annotation. Here we present the 2nd place solution in the MOSE track of PVUW 2024. To mitigate problems caused by tiny objects, similar objects and fast movements in MOSE. We use instance segmentation to generate extra pretraining data from the valid and test set of MOSE. The segmented instances are combined with objects extracted from COCO to augment the training data and enhance semantic representation of the baseline model. Besides, motion blur is added during training to increase robustness against image blur induced by motion. Finally, we apply test time augmentation (TTA) and memory strategy to the inference stage. Our method ranked 2nd in the MOSE track of PVUW 2024, with a $J$ of 0.8007, a $F$ of 0.8683 and a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications

MethodsSparse Evolutionary Training