Global Aggregation then Local Distribution for Scene Parsing

Xiangtai Li; Li Zhang; Guangliang Cheng; Kuiyuan Yang; Yunhai Tong,; Xiatian Zhu; Tao Xiang

arXiv:2107.13154·cs.CV·September 1, 2021

Global Aggregation then Local Distribution for Scene Parsing

Xiangtai Li, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong,, Xiatian Zhu, Tao Xiang

PDF

1 Repo

TL;DR

This paper introduces GALD, a novel module that combines global aggregation with local distribution to improve semantic segmentation accuracy, especially around boundaries and small objects.

Contribution

The paper proposes a new local distribution module that enhances global aggregation methods by adaptively modeling local relationships, leading to state-of-the-art results in semantic segmentation.

Findings

01

Achieved new state-of-the-art performance on Cityscapes, ADE20K, Pascal Context, Camvid, and COCO-stuff.

02

The approach is modular, end-to-end trainable, and easily integrated into existing networks.

03

Code and models are publicly available for further research.

Abstract

Modelling long-range contextual relationships is critical for pixel-wise prediction tasks such as semantic segmentation. However, convolutional neural networks (CNNs) are inherently limited to model such dependencies due to the naive structure in its building modules (\eg, local convolution kernel). While recent global aggregation methods are beneficial for long-range structure information modelling, they would oversmooth and bring noise to the regions containing fine details (\eg,~boundaries and small objects), which are very much cared for the semantic segmentation task. To alleviate this problem, we propose to explore the local context for making the aggregated long-range relationship being distributed more accurately in local regions. In particular, we design a novel local distribution module which models the affinity map between global and local relationship for each pixel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lxtGH/GALD-DGCNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution