Learning Deconvolution Network for Semantic Segmentation
Hyeonwoo Noh, Seunghoon Hong, Bohyung Han

TL;DR
This paper introduces a deconvolution network-based approach for semantic segmentation, leveraging VGG-derived features, to improve detail and multi-scale object recognition, achieving top accuracy on PASCAL VOC 2012 without external data.
Contribution
It presents a novel deep deconvolution network architecture for semantic segmentation that enhances detail and multi-scale object detection over existing fully convolutional methods.
Findings
Achieves 72.5% accuracy on PASCAL VOC 2012 without external data.
Effectively captures detailed structures and handles multiple object scales.
Outperforms previous methods in accuracy through ensemble techniques.
Abstract
We propose a novel semantic segmentation algorithm by learning a deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer net. The deconvolution network is composed of deconvolution and unpooling layers, which identify pixel-wise class labels and predict segmentation masks. We apply the trained network to each proposal in an input image, and construct the final semantic segmentation map by combining the results from all proposals in a simple manner. The proposed algorithm mitigates the limitations of the existing methods based on fully convolutional networks by integrating deep deconvolution network and proposal-wise prediction; our segmentation method typically identifies detailed structures and handles objects in multiple scales naturally. Our network demonstrates outstanding performance in PASCAL VOC 2012 dataset, and we achieve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Multimodal Machine Learning Applications
MethodsDropout · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Softmax · Convolution · Ethereum Customer Service Number +1-833-534-1729
