Recurrent Fully Convolutional Networks for Video Segmentation

Sepehr Valipour; Mennatullah Siam; Martin Jagersand; Nilanjan Ray

arXiv:1606.00487·cs.CV·November 1, 2016

Recurrent Fully Convolutional Networks for Video Segmentation

Sepehr Valipour, Mennatullah Siam, Martin Jagersand, Nilanjan Ray

PDF

TL;DR

This paper introduces a recurrent fully convolutional network with a novel convolutional gated recurrent unit for online video segmentation, demonstrating improved accuracy over non-recurrent methods.

Contribution

It presents a new recurrent architecture for video segmentation that preserves spatial information and operates online, unlike previous batch-based methods.

Findings

01

5.5% improvement in F-measure over plain FCN

02

1.4% F-measure improvement over baseline FCN 12s

03

Effective online segmentation with reduced parameters

Abstract

Image segmentation is an important step in most visual tasks. While convolutional neural networks have shown to perform well on single image segmentation, to our knowledge, no study has been been done on leveraging recurrent gated architectures for video segmentation. Accordingly, we propose a novel method for online segmentation of video sequences that incorporates temporal data. The network is built from fully convolutional element and recurrent unit that works on a sliding window over the temporal data. We also introduce a novel convolutional gated recurrent unit that preserves the spatial information and reduces the parameters learned. Our method has the advantage that it can work in an online fashion instead of operating over the whole input batch of video frames. The network is tested on the change detection dataset, and proved to have 5.5\% improvement in F-measure over a plain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMax Pooling · Convolution · Fully Convolutional Network