Large Kernel Matters -- Improve Semantic Segmentation by Global Convolutional Network
Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun

TL;DR
This paper introduces a Global Convolutional Network that leverages large kernels to improve semantic segmentation by enhancing classification and localization, achieving state-of-the-art results on benchmark datasets.
Contribution
The paper proposes a novel Global Convolutional Network architecture with large kernels and residual boundary refinement for better semantic segmentation.
Findings
Achieves 82.2% mIoU on PASCAL VOC 2012, surpassing previous methods.
Achieves 76.9% mIoU on Cityscapes, outperforming prior results.
Demonstrates the importance of large kernels in dense pixel prediction tasks.
Abstract
One of recent trends [30, 31, 14] in network architec- ture design is stacking small filters (e.g., 1x1 or 3x3) in the entire network because the stacked small filters is more ef- ficient than a large kernel, given the same computational complexity. However, in the field of semantic segmenta- tion, where we need to perform dense per-pixel prediction, we find that the large kernel (and effective receptive field) plays an important role when we have to perform the clas- sification and localization tasks simultaneously. Following our design principle, we propose a Global Convolutional Network to address both the classification and localization issues for the semantic segmentation. We also suggest a residual-based boundary refinement to further refine the ob- ject boundaries. Our approach achieves state-of-art perfor- mance on two public benchmarks and significantly outper- forms previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Remote-Sensing Image Classification · Automated Road and Building Extraction
MethodsAverage Pooling · Global Convolutional Network · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization
