Fully Convolutional Networks for Semantic Segmentation
Jonathan Long, Evan Shelhamer, Trevor Darrell

TL;DR
This paper introduces fully convolutional networks that perform end-to-end pixel-wise semantic segmentation, achieving state-of-the-art results efficiently by adapting classification models and combining multi-layer features.
Contribution
It presents the concept of fully convolutional networks for dense prediction, adapting existing classification architectures, and introduces a novel multi-layer feature integration for improved segmentation accuracy.
Findings
Achieved 62.2% mean IU on PASCAL VOC 2012
Reduced inference time to one third of a second per image
Outperformed previous state-of-the-art methods in semantic segmentation
Abstract
Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, exceed the state-of-the-art in semantic segmentation. Our key insight is to build "fully convolutional" networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning. We define and detail the space of fully convolutional networks, explain their application to spatially dense prediction tasks, and draw connections to prior models. We adapt contemporary classification networks (AlexNet, the VGG net, and GoogLeNet) into fully convolutional networks and transfer their learned representations by fine-tuning to the segmentation task. We then define a novel architecture that combines semantic information from a deep, coarse layer with appearance information from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsAverage Pooling · Dropout · Global Average Pooling · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Convolution · Ethereum Customer Service Number +1-833-534-1729 · Bitcoin Customer Service Number +1-833-534-1729
