Hybrid Mamba for Few-Shot Segmentation
Qianxiong Xu, Xuanyi Liu, Lanyun Zhu, Guosheng Lin, Cheng Long, Ziyue, Li, Rui Zhao

TL;DR
This paper introduces HMNet, a hybrid Mamba network for few-shot segmentation that effectively captures inter-sequence dependencies with linear complexity, improving support information utilization and segmentation accuracy.
Contribution
We propose a hybrid Mamba network with support recapped and query intercepted modules to enhance support feature retention and support-query fusion in few-shot segmentation.
Findings
HMNet outperforms existing methods on two benchmarks.
The support recapped Mamba maintains rich support information.
The query intercepted Mamba improves support-query fusion.
Abstract
Many few-shot segmentation (FSS) methods use cross attention to fuse support foreground (FG) into query features, regardless of the quadratic complexity. A recent advance Mamba can also well capture intra-sequence dependencies, yet the complexity is only linear. Hence, we aim to devise a cross (attention-like) Mamba to capture inter-sequence dependencies for FSS. A simple idea is to scan on support features to selectively compress them into the hidden state, which is then used as the initial hidden state to sequentially scan query features. Nevertheless, it suffers from (1) support forgetting issue: query features will also gradually be compressed when scanning on them, so the support features in hidden state keep reducing, and many query pixels cannot fuse sufficient support features; (2) intra-class gap issue: query FG is essentially more similar to itself rather than to support FG,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications
MethodsSoftmax · Attention Is All You Need · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
