High-performance Semantic Segmentation Using Very Deep Fully   Convolutional Networks

Zifeng Wu; Chunhua Shen; Anton van den Hengel

arXiv:1604.04339·cs.CV·April 18, 2016·86 cites

High-performance Semantic Segmentation Using Very Deep Fully Convolutional Networks

Zifeng Wu, Chunhua Shen, Anton van den Hengel

PDF

Open Access

TL;DR

This paper introduces a high-performance semantic segmentation method using very deep residual networks, incorporating novel training techniques and network configurations to achieve state-of-the-art accuracy on benchmark datasets.

Contribution

The paper presents a new approach to semantic segmentation with very deep residual networks, including a simulation method for high-resolution features and an online bootstrapping training strategy.

Findings

01

Achieved 78.3% mean IoU on PASCAL VOC 2012

02

Outperformed previous methods on Cityscapes dataset

03

Demonstrated effectiveness of online bootstrapping and residual dropout

Abstract

We propose a method for high-performance semantic image segmentation (or semantic pixel labelling) based on very deep residual networks, which achieves the state-of-the-art performance. A few design factors are carefully considered to this end. We make the following contributions. (i) First, we evaluate different variations of a fully convolutional residual network so as to find the best configuration, including the number of layers, the resolution of feature maps, and the size of field-of-view. Our experiments show that further enlarging the field-of-view and increasing the resolution of feature maps are typically beneficial, which however inevitably leads to a higher demand for GPU memories. To walk around the limitation, we propose a new method to simulate a high resolution network with a low resolution network, which can be applied during training and/or testing. (ii) Second, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications

MethodsDropout