Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object   Detection

Dingwen Zhang; Liangbo Cheng; Yi Liu; Xinggang Wang; Junwei Han

arXiv:2410.03987·cs.CV·October 8, 2024·2 cites

Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection

Dingwen Zhang, Liangbo Cheng, Yi Liu, Xinggang Wang, Junwei Han

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel mamba capsule routing method at the type level to improve camouflaged object detection, reducing computation and parameters compared to traditional pixel-level routing, and achieves superior results on benchmark datasets.

Contribution

The paper proposes a type-level mamba capsule routing approach that enhances efficiency and effectiveness in camouflaged object detection over existing pixel-level capsule routing methods.

Findings

01

Significantly outperforms state-of-the-art methods on three COD benchmarks.

02

Reduces computation and parameters compared to EM capsule routing.

03

Effectively retrieves pixel-level features guided by type-level capsules.

Abstract

The part-whole relational property endowed by Capsule Networks (CapsNets) has been known successful for camouflaged object detection due to its segmentation integrity. However, the previous Expectation Maximization (EM) capsule routing algorithm with heavy computation and large parameters obstructs this trend. The primary attribution behind lies in the pixel-level capsule routing. Alternatively, in this paper, we propose a novel mamba capsule routing at the type level. Specifically, we first extract the implicit latent state in mamba as capsule vectors, which abstract type-level capsules from pixel-level versions. These type-level mamba capsules are fed into the EM routing algorithm to get the high-layer mamba capsules, which greatly reduce the computation and parameters caused by the pixel-level capsule routing for part-whole relationships exploration. On top of that, to retrieve the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Liangbo-Cheng/mamba_capsule
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques

MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces