Dense Prediction with Attentive Feature Aggregation
Yung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bul\`o, Peter, Kontschieder, Fisher Yu

TL;DR
This paper introduces Attentive Feature Aggregation (AFA), a novel method for fusing features across network layers using attention mechanisms, significantly improving dense prediction tasks like semantic segmentation and boundary detection.
Contribution
We propose AFA, an expressive non-linear feature fusion technique utilizing spatial and channel attention, applicable to various networks, with extensions inspired by neural volume rendering for multi-scale predictions.
Findings
AFA improves semantic segmentation performance by nearly 6% mIoU on Cityscapes.
AFA enhances boundary detection, achieving state-of-the-art results on BSDS500 and NYUDv2.
The method adds negligible computational overhead while providing significant accuracy gains.
Abstract
Aggregating information from features across different layers is an essential operation for dense prediction models. Despite its limited expressiveness, feature concatenation dominates the choice of aggregation operations. In this paper, we introduce Attentive Feature Aggregation (AFA) to fuse different network layers with more expressive non-linear operations. AFA exploits both spatial and channel attention to compute weighted average of the layer activations. Inspired by neural volume rendering, we extend AFA with Scale-Space Rendering (SSR) to perform late fusion of multi-scale predictions. AFA is applicable to a wide range of existing network designs. Our experiments show consistent and significant improvements on challenging semantic segmentation benchmarks, including Cityscapes, BDD100K, and Mapillary Vistas, at negligible computational and parameter overhead. In particular, AFA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Dense Prediction with Attentive Feature Aggregation· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
