Video Object Segmentation with Re-identification

Xiaoxiao Li; Yuankai Qi; Zhe Wang; Kai Chen; Ziwei Liu; Jianping Shi,; Ping Luo; Xiaoou Tang; Chen Change Loy

arXiv:1708.00197·cs.CV·August 2, 2017·67 cites

Video Object Segmentation with Re-identification

Xiaoxiao Li, Yuankai Qi, Zhe Wang, Kai Chen, Ziwei Liu, Jianping Shi,, Ping Luo, Xiaoou Tang, Chen Change Loy

PDF

Open Access 3 Repos

TL;DR

This paper introduces VS-ReID, a video object segmentation model that combines mask propagation and adaptive re-identification to improve accuracy and robustness against large displacements and drifting.

Contribution

The paper proposes a novel re-identification mechanism integrated with mask propagation for more reliable video object segmentation.

Findings

01

Achieves a global mean of 0.699 on DAVIS 2017, outperforming previous methods.

02

Effectively handles large displacements and target re-identification.

03

Sets new state-of-the-art performance in video segmentation challenge.

Abstract

Conventional video segmentation methods often rely on temporal continuity to propagate masks. Such an assumption suffers from issues like drifting and inability to handle large displacement. To overcome these issues, we formulate an effective mechanism to prevent the target from being lost via adaptive object re-identification. Specifically, our Video Object Segmentation with Re-identification (VS-ReID) model includes a mask propagation module and a ReID module. The former module produces an initial probability map by flow warping while the latter module retrieves missing instances by adaptive matching. With these two modules iteratively applied, our VS-ReID records a global mean (Region Jaccard and Boundary F measure) of 0.699, the best performance in 2017 DAVIS Challenge.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection