One-Shot Video Object Segmentation

Sergi Caelles; Kevis-Kokitsi Maninis; Jordi Pont-Tuset; Laura; Leal-Taix\'e; Daniel Cremers; Luc Van Gool

arXiv:1611.05198·cs.CV·April 14, 2017

One-Shot Video Object Segmentation

Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura, Leal-Taix\'e, Daniel Cremers, Luc Van Gool

PDF

5 Repos

TL;DR

This paper introduces OSVOS, a one-shot video object segmentation method using a fully-convolutional neural network that effectively segments objects in videos with high accuracy and temporal coherence, based on minimal initial annotation.

Contribution

The paper presents a novel one-shot segmentation approach that transfers semantic knowledge from ImageNet to individual object segmentation in videos, achieving state-of-the-art results.

Findings

01

OSVOS achieves 79.8% accuracy, surpassing previous methods.

02

The method is fast and produces temporally coherent segmentation.

03

It effectively transfers learned features from generic to specific object segmentation.

Abstract

This paper tackles the task of semi-supervised video object segmentation, i.e., the separation of an object from the background in a video, given the mask of the first frame. We present One-Shot Video Object Segmentation (OSVOS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground segmentation, and finally to learning the appearance of a single annotated object of the test sequence (hence one-shot). Although all frames are processed independently, the results are temporally coherent and stable. We perform experiments on two annotated video segmentation databases, which show that OSVOS is fast and improves the state of the art by a significant margin (79.8% vs 68.0%).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.