Lightweight Backbone Networks Only Require Adaptive Lightweight Self-Attention Mechanisms
Fengyun Li, Chao Zheng, Yangyang Fang, Jialiang Lan, Jianhua Liang, Luhao Zhang, Fa Si

TL;DR
This paper introduces LOLViT, a lightweight hybrid backbone network that employs an adaptive lightweight self-attention mechanism called Fast Window Attention, achieving faster inference and higher accuracy in visual tasks.
Contribution
The paper proposes Fast Window Attention with adaptive feature map sizes and integrates it into LOLViT, a novel lightweight hybrid backbone network combining global-local feature fusion and GhostNet.
Findings
LOLViT outperforms CNN models of similar size in speed and accuracy.
LOLViT-X achieves 5x inference speed of MobileViT-X.
The adaptive attention mechanism effectively balances computational efficiency and modeling capacity.
Abstract
Currently, lightweight hybrid backbone networks have partially alleviated the issue of computational saturation, but the imbalance in computational efficiencys between convolutional neural networks (CNNs) and attention mechanisms is becoming increasingly apparent. Specifically, although linear attention mechanisms and their variants have made progress in lightweight design, they still fail to meet the demands of hybrid models for long-sequence modeling. On the other hand, existing lightweight SoftMax attention computations typically reduce the feature map to a fixed size to decrease the number of sequences, thereby compressing the computational scale. However, the process of determining the feature map reduction ratio is cumbersome, and computational saturation issues still persist. To address this issue, this paper proposes a lightweight SoftMax attention mechanism with adaptive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Big Data and Digital Economy · Generative Adversarial Networks and Image Synthesis
