Learning to Segment Instances in Videos with Spatial Propagation Network
Jingchun Cheng, Sifei Liu, Yi-Hsuan Tsai, Wei-Chih Hung, Shalini De, Mello, Jinwei Gu, Jan Kautz, Shengjin Wang, Ming-Hsuan Yang

TL;DR
This paper introduces a deep learning framework that combines a generic ResNet-101 model, instance-specific fine-tuning, and a spatial propagation network to achieve accurate instance-level segmentation in videos.
Contribution
It presents a novel combination of a generic segmentation model with spatial propagation for refined instance segmentation in videos.
Findings
Effective segmentation of multiple instances in videos
Improved spatial and temporal consistency in results
Robustness to various video scenarios
Abstract
We propose a deep learning-based framework for instance-level object segmentation. Our method mainly consists of three steps. First, We train a generic model based on ResNet-101 for foreground/background segmentations. Second, based on this generic model, we fine-tune it to learn instance-level models and segment individual objects by using augmented object annotations in first frames of test videos. To distinguish different instances in the same video, we compute a pixel-level score map for each object from these instance-level models. Each score map indicates the objectness likelihood and is only computed within the foreground mask obtained in the first step. To further refine this per frame score map, we learn a spatial propagation network. This network aims to learn how to propagate a coarse segmentation mask spatially based on the pairwise similarities in each frame. In addition,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
