FsaNet: Frequency Self-attention for Semantic Segmentation
Fengyu Zhang, Ashkan Panahi, Guangjun Gao

TL;DR
FsaNet introduces a low-complexity frequency self-attention mechanism that efficiently enhances semantic segmentation performance across multiple models and datasets, with significant reductions in computational resources.
Contribution
The paper proposes a novel frequency-based self-attention method that reduces complexity and memory usage while maintaining or improving segmentation accuracy.
Findings
Achieves up to 90% reduction in memory and FLOPs compared to regular self-attention.
Attains a new state-of-the-art 83.0% mIoU on Cityscapes with ResNet101.
Enhances various models like Mask R-CNN and Segformer without retraining.
Abstract
Considering the spectral properties of images, we propose a new self-attention mechanism with highly reduced computational complexity, up to a linear rate. To better preserve edges while promoting similarity within objects, we propose individualized processes over different frequency bands. In particular, we study a case where the process is merely over low-frequency components. By ablation study, we show that low frequency self-attention can achieve very close or better performance relative to full frequency even without retraining the network. Accordingly, we design and embed novel plug-and-play modules to the head of a CNN network that we refer to as FsaNet. The frequency self-attention 1) requires only a few low frequency coefficients as input, 2) can be mathematically equivalent to spatial domain self-attention with linear structures, 3) simplifies token mapping (…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Brain Tumor Detection and Classification · Domain Adaptation and Few-Shot Learning
MethodsDense Connections · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · Mix-FFN · Linear Layer · Softmax · SegFormer · Convolution · RoIAlign · Region Proposal Network
