Fully Convolutional Networks for Semantic Segmentation
Evan Shelhamer, Jonathan Long, Trevor Darrell

TL;DR
This paper introduces fully convolutional networks for semantic segmentation, enabling end-to-end pixel-wise predictions that significantly improve accuracy and inference speed on multiple datasets.
Contribution
It adapts classification networks into fully convolutional architectures and proposes a skip architecture for detailed segmentation, advancing the state-of-the-art.
Findings
30% relative improvement on PASCAL VOC 2012
Achieves 67.2% mean IU on PASCAL VOC 2012
Inference time of one-tenth of a second per image
Abstract
Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, improve on the previous best result in semantic segmentation. Our key insight is to build "fully convolutional" networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet, the VGG net, and GoogLeNet) into fully convolutional networks and transfer their learned representations by fine-tuning to the segmentation task. We then define a skip architecture that combines semantic information from a deep, coarse layer with appearance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsTransposed convolution · Dropout · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Convolution · Ethereum Customer Service Number +1-833-534-1729 · Fully Convolutional Network
