FastFCN: Rethinking Dilated Convolution in the Backbone for Semantic Segmentation
Huikai Wu, Junge Zhang, Kaiqi Huang, Kongming Liang, Yizhou Yu

TL;DR
This paper introduces Joint Pyramid Upsampling (JPU), a novel module that replaces dilated convolutions in semantic segmentation models, significantly reducing computation and memory use while maintaining or improving accuracy.
Contribution
The paper proposes JPU, a new joint upsampling module that replaces dilated convolutions, reducing complexity and enhancing performance in semantic segmentation tasks.
Findings
JPU reduces computation by over three times without performance loss.
JPU outperforms other upsampling modules in experiments.
The method achieves state-of-the-art results on Pascal Context and ADE20K datasets.
Abstract
Modern approaches for semantic segmentation usually employ dilated convolutions in the backbone to extract high-resolution feature maps, which brings heavy computation complexity and memory footprint. To replace the time and memory consuming dilated convolutions, we propose a novel joint upsampling module named Joint Pyramid Upsampling (JPU) by formulating the task of extracting high-resolution feature maps into a joint upsampling problem. With the proposed JPU, our method reduces the computation complexity by more than three times without performance loss. Experiments show that JPU is superior to other upsampling modules, which can be plugged into many existing approaches to reduce computation complexity and improve performance. By replacing dilated convolutions with the proposed JPU module, our method achieves the state-of-the-art performance in Pascal Context dataset (mIoU of 53.13%)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
