TL;DR
This paper introduces GALD, a novel module that combines global aggregation with local distribution to improve semantic segmentation accuracy, especially around boundaries and small objects.
Contribution
The paper proposes a new local distribution module that enhances global aggregation methods by adaptively modeling local relationships, leading to state-of-the-art results in semantic segmentation.
Findings
Achieved new state-of-the-art performance on Cityscapes, ADE20K, Pascal Context, Camvid, and COCO-stuff.
The approach is modular, end-to-end trainable, and easily integrated into existing networks.
Code and models are publicly available for further research.
Abstract
Modelling long-range contextual relationships is critical for pixel-wise prediction tasks such as semantic segmentation. However, convolutional neural networks (CNNs) are inherently limited to model such dependencies due to the naive structure in its building modules (\eg, local convolution kernel). While recent global aggregation methods are beneficial for long-range structure information modelling, they would oversmooth and bring noise to the regions containing fine details (\eg,~boundaries and small objects), which are very much cared for the semantic segmentation task. To alleviate this problem, we propose to explore the local context for making the aggregated long-range relationship being distributed more accurately in local regions. In particular, we design a novel local distribution module which models the affinity map between global and local relationship for each pixel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
