Training-free Token Reduction for Vision Mamba

Qiankun Ma; Ziyao Zhang; Chi Su; Jie Chen; Zhen Song; Hairong Zheng; Wen Gao

arXiv:2507.14042·cs.CV·July 21, 2025

Training-free Token Reduction for Vision Mamba

Qiankun Ma, Ziyao Zhang, Chi Su, Jie Chen, Zhen Song, Hairong Zheng, Wen Gao

PDF

Open Access

TL;DR

This paper introduces MTR, a training-free token reduction method for Vision Mamba that significantly reduces computational costs with minimal performance loss, without requiring retraining or additional tuning.

Contribution

We propose a novel, training-free token reduction framework specifically designed for Vision Mamba, addressing its unique sequence structure without relying on attention mechanisms.

Findings

01

Reduces FLOPs by approximately 40% on Vim-B backbone

02

Achieves only 1.6% performance drop on ImageNet without retraining

03

Effectively integrates across various Mamba models

Abstract

Vision Mamba has emerged as a strong competitor to Vision Transformers (ViTs) due to its ability to efficiently capture long-range dependencies with linear computational complexity. While token reduction, an effective compression technique in ViTs, has rarely been explored in Vision Mamba. Exploring Vision Mamba's efficiency is essential for enabling broader applications. However, we find that directly applying existing token reduction techniques for ViTs to Vision Mamba leads to significant performance degradation. This is primarily because Mamba is a sequence model without attention mechanisms, whereas most token reduction techniques for ViTs rely on attention mechanisms for importance measurement and overlook the order of compressed tokens. In this paper, we investigate a Mamba structure-aware importance score to evaluate token importance in a simple and effective manner. Building on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning