Beyond Single Stage Encoder-Decoder Networks: Deep Decoders for Semantic Image Segmentation
Gabriel L. Oliveira, Senthil Yogamani, Wolfram Burgard, Thomas Brox

TL;DR
This paper introduces a novel deep decoder architecture with residual skip connections and a class re-balancing weight function, achieving state-of-the-art results in semantic segmentation and enhancing existing methods with motion information.
Contribution
It proposes a new deep decoder architecture with residual skip connections and a class re-balancing weight function, improving segmentation accuracy.
Findings
State-of-the-art results on CamVid, Gatech, and Freiburg Forest datasets.
Enhanced segmentation performance when combining optical flow with image data.
Effective improvement of existing segmentation methods using the proposed decoder.
Abstract
Single encoder-decoder methodologies for semantic segmentation are reaching their peak in terms of segmentation quality and efficiency per number of layers. To address these limitations, we propose a new architecture based on a decoder which uses a set of shallow networks for capturing more information content. The new decoder has a new topology of skip connections, namely backward and stacked residual connections. In order to further improve the architecture we introduce a weight function which aims to re-balance classes to increase the attention of the networks to under-represented objects. We carried out an extensive set of experiments that yielded state-of-the-art results for the CamVid, Gatech and Freiburg Forest datasets. Moreover, to further prove the effectiveness of our decoder, we conducted a set of experiments studying the impact of our decoder to state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications
