Adaptive Context Network for Scene Parsing
Jun Fu, Jing Liu, Yuhang Wang, Yong Li, Yongjun Bao, Jinhui Tang,, Hanqing Lu

TL;DR
This paper introduces ACNet, an adaptive scene parsing network that dynamically fuses global and local contexts per pixel, leading to state-of-the-art results across multiple datasets.
Contribution
It proposes a novel pixel-aware context fusion method using adaptive modules that measure and combine global and local context demands for improved scene parsing.
Findings
Achieves new state-of-the-art performance on Cityscapes, ADE20K, PASCAL Context, and COCO Stuff datasets.
Demonstrates the effectiveness of adaptive context fusion over fixed context methods.
Validates the approach through comprehensive experimental evaluations.
Abstract
Recent works attempt to improve scene parsing performance by exploring different levels of contexts, and typically train a well-designed convolutional network to exploit useful contexts across all pixels equally. However, in this paper, we find that the context demands are varying from different pixels or regions in each image. Based on this observation, we propose an Adaptive Context Network (ACNet) to capture the pixel-aware contexts by a competitive fusion of global context and local context according to different per-pixel demands. Specifically, when given a pixel, the global context demand is measured by the similarity between the global feature and its local feature, whose reverse value can be used to measure the local context demand. We model the two demand measurements by the proposed global context module and local context module, respectively, to generate adaptive contextual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Multimodal Machine Learning Applications
