TL;DR
This paper introduces AGLN, a novel network that enhances global context and refines local features for improved semantic segmentation, achieving state-of-the-art results on multiple datasets.
Contribution
The paper proposes a new architecture combining global enhancement and local refinement modules within a context fusion block for better segmentation accuracy.
Findings
Achieves 56.23% mIoU on PASCAL Context with ResNet-101 backbone.
Outperforms existing lightweight segmentation models on ADE20K and PASCAL VOC 2012.
Effectively integrates global and local features for superior segmentation performance.
Abstract
The encoder-decoder architecture is widely used as a lightweight semantic segmentation network. However, it struggles with a limited performance compared to a well-designed Dilated-FCN model for two major problems. First, commonly used upsampling methods in the decoder such as interpolation and deconvolution suffer from a local receptive field, unable to encode global contexts. Second, low-level features may bring noises to the network decoder through skip connections for the inadequacy of semantic concepts in early encoder layers. To tackle these challenges, a Global Enhancement Method is proposed to aggregate global information from high-level feature maps and adaptively distribute them to different decoder layers, alleviating the shortage of global contexts in the upsampling process. Besides, a Local Refinement Module is developed by utilizing the decoder features as the semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
