TL;DR
This paper introduces a novel semantic segmentation architecture that replaces traditional decoders with a depth-to-space operation, significantly reducing computational cost while maintaining competitive performance.
Contribution
The authors propose a decoder-free segmentation model using depth-to-space, achieving efficiency gains over traditional encoder-decoder architectures.
Findings
Reduces computation by nearly 50%
Achieves comparable accuracy to standard models
Performs well on DeepGlobe Road Extraction dataset
Abstract
In this paper, we propose an efficient architecture for semantic image segmentation using the depth-to-space (D2S) operation. Our D2S model is comprised of a standard CNN encoder followed by a depth-to-space reordering of the final convolutional feature maps. Our approach eliminates the decoder portion of traditional encoder-decoder segmentation models and reduces the amount of computation almost by half. As a participant of the DeepGlobe Road Extraction competition, we evaluate our models on the corresponding road segmentation dataset. Our highly efficient D2S models exhibit comparable performance to standard segmentation models with much lower computational cost.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
