IP-SAM: Prompt-Space Conditioning for Prompt-Absent Camouflaged Object Detection

Huiyao Zhang; Jin Bai; Rui Guo; JianWen Tan; HongFei Wang; Ye Li

arXiv:2603.27250·cs.CV·March 31, 2026

IP-SAM: Prompt-Space Conditioning for Prompt-Absent Camouflaged Object Detection

Huiyao Zhang, Jin Bai, Rui Guo, JianWen Tan, HongFei Wang, Ye Li

PDF

TL;DR

IP-SAM introduces a prompt-space conditioning approach for camouflaged object detection, enabling fully automatic segmentation with state-of-the-art results without external prompts.

Contribution

The paper proposes a novel prompt-space conditioning method with intrinsic prompts and gating, improving automatic segmentation performance and generalizing to medical image segmentation.

Findings

01

Achieves state-of-the-art performance on four camouflaged object detection benchmarks.

02

Maintains high accuracy with only 21.26M trainable parameters.

03

Demonstrates strong zero-shot transfer to medical polyp segmentation.

Abstract

Prompt-conditioned foundation segmenters have emerged as a dominant paradigm for image segmentation, where explicit spatial prompts (e.g., points, boxes, masks) guide mask decoding. However, many real-world deployments require fully automatic segmentation, creating a structural mismatch: the decoder expects prompts that are unavailable at inference. Existing adaptations typically modify intermediate features, inadvertently bypassing the model's native prompt interface and weakening prompt-conditioned decoding. We propose IP-SAM, which revisits adaptation from a prompt-space perspective through prompt-space conditioning. Specifically, a Self-Prompt Generator (SPG) distills image context into complementary intrinsic prompts that serve as coarse regional anchors. These cues are projected through SAM2's frozen prompt encoder, restoring prompt-guided decoding without external intervention.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.