LBMamba: Locally Bi-directional Mamba
Jingwei Zhang, Xi Han, Hong Qin, Mahdi S. Hosseini, Dimitris Samaras

TL;DR
LBMamba introduces a locally bi-directional state space model that enhances the efficiency and accuracy of Mamba-based vision models by embedding a lightweight backward scan within the forward pass, avoiding extra computations.
Contribution
It proposes LBMamba, a novel locally bi-directional SSM block, and LBVim, a backbone that recovers global receptive fields without additional backward scans, improving performance and efficiency.
Findings
Achieves higher accuracy on ImageNet, ADE20K, and COCO datasets.
Boosts the performance of multiple SOTA Mamba models.
Improves WSI classification metrics significantly.
Abstract
Mamba, a State Space Model (SSM) that accelerates training by recasting recurrence as a parallel scan, has recently emerged as a linearly-scaling alternative to self-attention. Because of its unidirectional nature, each state in Mamba only has information of its previous states and is blind to states after. Current Mamba-based computer-vision methods typically overcome this by augmenting Mamba's global forward scan with a global backward scan, forming a bi-directional scan to restore a full receptive field. However, this operation doubles the computational load, eroding much of the efficiency advantage that originally Mamba have. To eliminate this extra scans, we introduce LBMamba, a locally bi-directional SSM block that embeds a lightweight locally backward scan inside the forward scan and executes it in per-thread registers. Building on LBMamba, we present LBVim, a backbone that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · EEG and Brain-Computer Interfaces
