Learning Video Object Segmentation from Static Images

Anna Khoreva; Federico Perazzi; Rodrigo Benenson; Bernt Schiele,; Alexander Sorkine-Hornung

arXiv:1612.02646·cs.CV·February 5, 2019·70 cites

Learning Video Object Segmentation from Static Images

Anna Khoreva, Federico Perazzi, Rodrigo Benenson, Bernt Schiele,, Alexander Sorkine-Hornung

PDF

Open Access 2 Repos

TL;DR

This paper presents a novel video object segmentation method that leverages static image training and combines offline and online learning to achieve accurate, flexible segmentation across various datasets and annotation types.

Contribution

It introduces a guided instance segmentation approach that operates per frame, using static image-trained convnets and combined learning strategies for improved video segmentation.

Findings

01

Achieves competitive results on three datasets

02

Handles multiple annotation types effectively

03

Operates with static image training only

Abstract

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation. Our model proceeds on a per-frame basis, guided by the output of the previous frame towards the object of interest in the next frame. We demonstrate that highly accurate object segmentation in videos can be enabled by using a convnet trained with static images only. The key ingredient of our approach is a combination of offline and online learning strategies, where the former serves to produce a refined mask from the previous frame estimate and the latter allows to capture the appearance of the specific object instance. Our method can handle different types of input annotations: bounding boxes and segments, as well as incorporate multiple annotated frames, making the system suitable for diverse…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection