Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation
Tianfei Zhou, Jianwu Li, Xueyi Li, Ling Shao

TL;DR
This paper introduces a novel unsupervised video multi-object segmentation method that combines instance discrimination with temporal guidance and a discriminative appearance model, improving accuracy and speed.
Contribution
It proposes an integrated approach for object discovery and association that enhances generalization and efficiency in unsupervised video segmentation.
Findings
Outperforms state-of-the-art methods on DAVIS17 and YouTube-VIS datasets.
Achieves higher segmentation accuracy and faster inference.
Effectively captures target-specific features for better temporal association.
Abstract
This paper addresses the task of unsupervised video multi-object segmentation. Current approaches follow a two-stage paradigm: 1) detect object proposals using pre-trained Mask R-CNN, and 2) conduct generic feature matching for temporal association using re-identification techniques. However, the generic features, widely used in both stages, are not reliable for characterizing unseen objects, leading to poor generalization. To address this, we introduce a novel approach for more accurate and efficient spatio-temporal segmentation. In particular, to address \textbf{instance discrimination}, we propose to combine foreground region estimation and instance grouping together in one network, and additionally introduce temporal guidance for segmenting each frame, enabling more accurate object discovery. For \textbf{temporal association}, we complement current video object segmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Video Surveillance and Tracking Methods
MethodsRegion Proposal Network · Softmax · RoIAlign · Convolution · Mask R-CNN
