Frequency-Dynamic Attention Modulation for Dense Prediction
Linwei Chen, Lin Gu, Ying Fu

TL;DR
This paper introduces Frequency-Dynamic Attention Modulation (FDAM), a novel method inspired by circuit theory, to enhance Vision Transformers by dynamically adjusting their frequency response, thereby preserving details and textures across various vision tasks.
Contribution
FDAM is a new, plug-in strategy for ViTs that modulates frequency response using attention inversion and dynamic scaling, improving performance without representation collapse.
Findings
Improves semantic segmentation, object detection, and instance segmentation results.
Achieves state-of-the-art remote sensing detection performance.
Avoids representation collapse through feature similarity and rank analysis.
Abstract
Vision Transformers (ViTs) have significantly advanced computer vision, demonstrating strong performance across various tasks. However, the attention mechanism in ViTs makes each layer function as a low-pass filter, and the stacked-layer architecture in existing transformers suffers from frequency vanishing. This leads to the loss of critical details and textures. We propose a novel, circuit-theory-inspired strategy called Frequency-Dynamic Attention Modulation (FDAM), which can be easily plugged into ViTs. FDAM directly modulates the overall frequency response of ViTs and consists of two techniques: Attention Inversion (AttInv) and Frequency Dynamic Scaling (FreqScale). Since circuit theory uses low-pass filters as fundamental elements, we introduce AttInv, a method that generates complementary high-pass filtering by inverting the low-pass filter in the attention matrix, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Blind Source Separation Techniques
