Collaborative Video Object Segmentation by Foreground-Background   Integration

Zongxin Yang; Yunchao Wei; Yi Yang

arXiv:2003.08333·cs.CV·July 24, 2020·21 cites

Collaborative Video Object Segmentation by Foreground-Background Integration

Zongxin Yang, Yunchao Wei, Yi Yang

PDF

Open Access 2 Repos

TL;DR

This paper introduces CFBI, a novel semi-supervised video object segmentation method that treats foreground and background features equally, enhancing segmentation robustness and outperforming existing state-of-the-art approaches on multiple benchmarks.

Contribution

The paper proposes a contrastive embedding learning approach that integrates foreground and background features for improved video object segmentation.

Findings

01

Achieves state-of-the-art performance on DAVIS 2016, DAVIS 2017, and YouTube-VOS datasets.

02

Effectively promotes contrastive feature embedding between foreground and background.

03

Demonstrates robustness to various object scales through combined pixel and instance-level matching.

Abstract

This paper investigates the principles of embedding learning to tackle the challenging semi-supervised video object segmentation. Different from previous practices that only explore the embedding learning using pixels from foreground object (s), we consider background should be equally treated and thus propose Collaborative video object segmentation by Foreground-Background Integration (CFBI) approach. Our CFBI implicitly imposes the feature embedding from the target foreground object and its corresponding background to be contrastive, promoting the segmentation results accordingly. With the feature embedding from both foreground and background, our CFBI performs the matching process between the reference and the predicted sequence from both pixel and instance levels, making the CFBI be robust to various object scales. We conduct extensive experiments on three popular benchmarks, i.e.,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques