Fast-ParC: Capturing Position Aware Global Feature for ConvNets and ViTs
Tao Yang, Haokui Zhang, Wenze Hu, Changwen Chen, Xiaoyu Wang

TL;DR
This paper introduces Fast-ParC, a novel position-aware global convolution operator that combines the strengths of ViTs and ConvNets, enabling efficient global feature extraction with reduced computational complexity.
Contribution
The paper proposes the Fast-ParC operator, which captures global features efficiently and can be integrated into existing models to enhance their performance and speed.
Findings
Fast-ParC reduces complexity from O(n^2) to O(n log n) using FFT.
Using ParC enlarges receptive fields and improves accuracy in vision tasks.
Fast-ParC benefits both ViTs and ConvNets across multiple vision benchmarks.
Abstract
Transformer models have made tremendous progress in various fields in recent years. In the field of computer vision, vision transformers (ViTs) also become strong alternatives to convolutional neural networks (ConvNets), yet they have not been able to replace ConvNets since both have their own merits. For instance, ViTs are good at extracting global features with attention mechanisms while ConvNets are more efficient in modeling local relationships due to their strong inductive bias. A natural idea that arises is to combine the strengths of both ConvNets and ViTs to design new structures. In this paper, we propose a new basic neural network operator named position-aware circular convolution (ParC) and its accelerated version Fast-ParC. The ParC operator can capture global features by using a global kernel and circular convolution while keeping location sensitiveness by employing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
MethodsConvolution
