IP-SAM: Prompt-Space Conditioning for Prompt-Absent Camouflaged Object Detection
Huiyao Zhang, Jin Bai, Rui Guo, JianWen Tan, HongFei Wang, Ye Li

TL;DR
IP-SAM introduces a prompt-space conditioning approach for camouflaged object detection, enabling fully automatic segmentation with state-of-the-art results without external prompts.
Contribution
The paper proposes a novel prompt-space conditioning method with intrinsic prompts and gating, improving automatic segmentation performance and generalizing to medical image segmentation.
Findings
Achieves state-of-the-art performance on four camouflaged object detection benchmarks.
Maintains high accuracy with only 21.26M trainable parameters.
Demonstrates strong zero-shot transfer to medical polyp segmentation.
Abstract
Prompt-conditioned foundation segmenters have emerged as a dominant paradigm for image segmentation, where explicit spatial prompts (e.g., points, boxes, masks) guide mask decoding. However, many real-world deployments require fully automatic segmentation, creating a structural mismatch: the decoder expects prompts that are unavailable at inference. Existing adaptations typically modify intermediate features, inadvertently bypassing the model's native prompt interface and weakening prompt-conditioned decoding. We propose IP-SAM, which revisits adaptation from a prompt-space perspective through prompt-space conditioning. Specifically, a Self-Prompt Generator (SPG) distills image context into complementary intrinsic prompts that serve as coarse regional anchors. These cues are projected through SAM2's frozen prompt encoder, restoring prompt-guided decoding without external intervention.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
