Fully Attentional Network for Semantic Segmentation
Qi Song, Jie Li, Chenghong Li, Hao Guo, Rui Huang

TL;DR
This paper introduces FLANet, a fully attentional network that captures both spatial and channel dependencies simultaneously, improving semantic segmentation accuracy while maintaining efficiency.
Contribution
The paper proposes a novel fully attentional module that encodes spatial and channel attentions in a single similarity map, addressing attention missing issues in prior methods.
Findings
Achieved state-of-the-art results on Cityscapes, ADE20K, and PASCAL VOC datasets.
Outperformed existing methods in capturing feature dependencies for segmentation.
Maintained high computational efficiency with the new attention mechanism.
Abstract
Recent non-local self-attention methods have proven to be effective in capturing long-range dependencies for semantic segmentation. These methods usually form a similarity map of RC*C (by compressing spatial dimensions) or RHW*HW (by compressing channels) to describe the feature relations along either channel or spatial dimensions, where C is the number of channels, H and W are the spatial dimensions of the input feature map. However, such practices tend to condense feature dependencies along the other dimensions,hence causing attention missing, which might lead to inferior results for small/thin categories or inconsistent segmentation inside large objects. To address this problem, we propose anew approach, namely Fully Attentional Network (FLANet),to encode both spatial and channel attentions in a single similarity map while maintaining high computational efficiency. Specifically, for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
