Deep Deconvolutional Networks for Scene Parsing
Rahul Mohan

TL;DR
This paper introduces a novel deep deconvolutional neural network architecture for scene parsing that learns higher order image structures and spatial priors, achieving state-of-the-art results without post-processing.
Contribution
It proposes a new network combining deconvolutional networks with CNNs and introduces multi-patch training for scene parsing.
Findings
Achieves state-of-the-art performance on four datasets.
Capable of learning higher order image structures.
Operates without post-processing steps.
Abstract
Scene parsing is an important and challenging prob- lem in computer vision. It requires labeling each pixel in an image with the category it belongs to. Tradition- ally, it has been approached with hand-engineered features from color information in images. Recently convolutional neural networks (CNNs), which automatically learn hierar- chies of features, have achieved record performance on the task. These approaches typically include a post-processing technique, such as superpixels, to produce the final label- ing. In this paper, we propose a novel network architecture that combines deep deconvolutional neural networks with CNNs. Our experiments show that deconvolutional neu- ral networks are capable of learning higher order image structure beyond edge primitives in comparison to CNNs. The new network architecture is employed for multi-patch training, introduced as part of this work.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Advanced Vision and Imaging
