Training-free Token Reduction for Vision Mamba
Qiankun Ma, Ziyao Zhang, Chi Su, Jie Chen, Zhen Song, Hairong Zheng, Wen Gao

TL;DR
This paper introduces MTR, a training-free token reduction method for Vision Mamba that significantly reduces computational costs with minimal performance loss, without requiring retraining or additional tuning.
Contribution
We propose a novel, training-free token reduction framework specifically designed for Vision Mamba, addressing its unique sequence structure without relying on attention mechanisms.
Findings
Reduces FLOPs by approximately 40% on Vim-B backbone
Achieves only 1.6% performance drop on ImageNet without retraining
Effectively integrates across various Mamba models
Abstract
Vision Mamba has emerged as a strong competitor to Vision Transformers (ViTs) due to its ability to efficiently capture long-range dependencies with linear computational complexity. While token reduction, an effective compression technique in ViTs, has rarely been explored in Vision Mamba. Exploring Vision Mamba's efficiency is essential for enabling broader applications. However, we find that directly applying existing token reduction techniques for ViTs to Vision Mamba leads to significant performance degradation. This is primarily because Mamba is a sequence model without attention mechanisms, whereas most token reduction techniques for ViTs rely on attention mechanisms for importance measurement and overlook the order of compressed tokens. In this paper, we investigate a Mamba structure-aware importance score to evaluate token importance in a simple and effective manner. Building on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
