TL;DR
This paper introduces a one-shot, training-free method for sequential infrared small target segmentation that leverages SAM's zero-shot capabilities, achieving state-of-the-art performance with minimal data.
Contribution
The authors adapt the Segment Anything Model for IRSTS using a novel confidence map and prompt-centric modules, enabling effective segmentation without training.
Findings
Achieves comparable performance to state-of-the-art methods with one shot.
Significantly outperforms other one-shot segmentation approaches.
Demonstrates robustness across annotation types and reference image choices.
Abstract
Infrared small target sequences exhibit strong similarities between frames and contain rich contextual information, which motivates us to achieve sequential infrared small target segmentation (IRSTS) with minimal data. Inspired by the success of Segment Anything Model (SAM) across various downstream tasks, we propose a one-shot and training-free method that perfectly adapts SAM's zero-shot generalization capability to sequential IRSTS. Specifically, we first obtain a confidence map through local feature matching (LFM). The highest point in the confidence map is used as the prompt to replace the manual prompt. Then, to address the over-segmentation issue caused by the domain gap, we design the point prompt-centric focusing (PPCF) module. Subsequently, to prevent miss and false detections, we introduce the triple-level ensemble (TLE) module to produce the final mask. Experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
