Rotation Equivariant Mamba for Vision Tasks

Zhongchen Zhao; Qi Xie; Keyu Huang; Lei Zhang; Deyu Meng; and Zongben Xu

arXiv:2603.09138·cs.CV·April 7, 2026

Rotation Equivariant Mamba for Vision Tasks

Zhongchen Zhao, Qi Xie, Keyu Huang, Lei Zhang, Deyu Meng, and Zongben Xu

PDF

1 Repo

TL;DR

This paper introduces EQ-VMamba, a novel rotation equivariant architecture for vision tasks that improves robustness and efficiency by embedding rotation symmetry into Mamba-based models.

Contribution

It presents the first rotation equivariant visual Mamba architecture, combining a new strategy and theoretical analysis to enforce end-to-end rotation equivariance.

Findings

01

EQ-VMamba improves rotation robustness across benchmarks.

02

It achieves superior or competitive performance with fewer parameters.

03

The architecture enhances model robustness and efficiency.

Abstract

Rotation equivariance constitutes one of the most general and crucial structural priors for visual data, yet it remains notably absent from current Mamba-based vision architectures. Despite the success of Mamba in natural language processing and its growing adoption in computer vision, existing visual Mamba models fail to account for rotational symmetry in their design. This omission renders them inherently sensitive to image rotations, thereby constraining their robustness and cross-task generalization. To address this limitation, we incorporate rotation symmetry, a universal and fundamental geometric prior in images, into Mamba-based architectures. Specifically, we introduce EQ-VMamba, the first rotation equivariant visual Mamba architecture for vision tasks. The core components of EQ-VMamba include a carefully designed rotation equivariant cross-scan strategy and group Mamba blocks.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhongchenzhao/EQ-VMamba
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.