Unleashing the Power of Generic Segmentation Models: A Simple Baseline   for Infrared Small Target Detection

Mingjin Zhang; Chi Zhang; Qiming Zhang; Yunsong Li; Xinbo Gao; Jing; Zhang

arXiv:2409.04714·cs.CV·September 10, 2024

Unleashing the Power of Generic Segmentation Models: A Simple Baseline for Infrared Small Target Detection

Mingjin Zhang, Chi Zhang, Qiming Zhang, Yunsong Li, Xinbo Gao, Jing, Zhang

PDF

1 Repo

TL;DR

This paper explores adapting generic segmentation models like SAM for infrared small target detection, proposing a lightweight baseline that, with distillation and novel query design, surpasses state-of-the-art methods in accuracy and efficiency.

Contribution

It introduces a simple, effective baseline model for IRSTD that leverages distillation and multi-scale query design to outperform existing specialized methods.

Findings

01

Our model surpasses SAM and Semantic-SAM by over 14 IoU on NUDT.

02

It outperforms state-of-the-art IRSTD methods in accuracy.

03

The approach improves both accuracy and throughput across datasets.

Abstract

Recent advancements in deep learning have greatly advanced the field of infrared small object detection (IRSTD). Despite their remarkable success, a notable gap persists between these IRSTD methods and generic segmentation approaches in natural image domains. This gap primarily arises from the significant modality differences and the limited availability of infrared data. In this study, we aim to bridge this divergence by investigating the adaptation of generic segmentation models, such as the Segment Anything Model (SAM), to IRSTD tasks. Our investigation reveals that many generic segmentation models can achieve comparable performance to state-of-the-art IRSTD methods. However, their full potential in IRSTD remains untapped. To address this, we propose a simple, lightweight, yet effective baseline model for segmenting small infrared objects. Through appropriate distillation strategies,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

o937-blip/simir
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSegment Anything Model