Denoised Non-Local Neural Network for Semantic Segmentation
Qi Song, Jie Li, Hao Guo, Rui Huang

TL;DR
This paper introduces a denoising approach for non-local neural networks in semantic segmentation, improving attention map reliability by reducing noise and enhancing accuracy without external data.
Contribution
The paper proposes a novel Denoised Non-Local Network with Global Rectifying and Local Retention modules to effectively eliminate attention noise in semantic segmentation.
Findings
Achieves state-of-the-art 83.5% mIoU on Cityscapes.
Attains 46.69% mIoU on ADE20K.
Improves segmentation accuracy by denoising attention maps.
Abstract
The non-local network has become a widely used technique for semantic segmentation, which computes an attention map to measure the relationships of each pixel pair. However, most of the current popular non-local models tend to ignore the phenomenon that the calculated attention map appears to be very noisy, containing inter-class and intra-class inconsistencies, which lowers the accuracy and reliability of the non-local methods. In this paper, we figuratively denote these inconsistencies as attention noises and explore the solutions to denoise them. Specifically, we inventively propose a Denoised Non-Local Network (Denoised NL), which consists of two primary modules, i.e., the Global Rectifying (GR) block and the Local Retention (LR) block, to eliminate the inter-class and intra-class noises respectively. First, GR adopts the class-level predictions to capture a binary map to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
