Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
John Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar,, Bryan Catanzaro

TL;DR
This paper introduces AFNO, an efficient Fourier-based token mixer for vision transformers, which reduces computational complexity and improves performance on high-resolution image segmentation tasks.
Contribution
The paper proposes AFNO, a novel Fourier neural operator-based token mixer with architectural modifications for efficient high-resolution vision transformer processing.
Findings
AFNO outperforms self-attention in few-shot segmentation accuracy.
AFNO handles sequence sizes of 65k efficiently in Cityscapes segmentation.
AFNO achieves quasi-linear complexity and linear memory in sequence size.
Abstract
Vision transformers have delivered tremendous success in representation learning. This is primarily due to effective token mixing through self attention. However, this scales quadratically with the number of pixels, which becomes infeasible for high-resolution inputs. To cope with this challenge, we propose Adaptive Fourier Neural Operator (AFNO) as an efficient token mixer that learns to mix in the Fourier domain. AFNO is based on a principled foundation of operator learning which allows us to frame token mixing as a continuous global convolution without any dependence on the input resolution. This principle was previously used to design FNO, which solves global convolution efficiently in the Fourier domain and has shown promise in learning challenging PDEs. To handle challenges in visual representation learning such as discontinuities in images and high resolution inputs, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Neural Network Applications · Advanced Memory and Neural Computing
MethodsConvolution
