SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target Detection
Yahao Lu, Yupei Lin, Han Wu, Xiaoyu Xian, Yukai Shi, Liang Lin

TL;DR
This paper introduces a negative sample augmentation method using self-supervised learning to generate a large synthetic dataset for infrared small target detection, significantly improving model performance and training efficiency.
Contribution
It proposes a novel negative augmentation strategy with realistic noise modeling to create a large, diverse synthetic dataset for enhanced infrared small target detection.
Findings
Improved detection accuracy over state-of-the-art methods
Faster convergence in training models
Enhanced dataset diversity and realism
Abstract
Single-frame infrared small target (SIRST) detection aims to recognize small targets from clutter backgrounds. Recently, convolutional neural networks have achieved significant advantages in general object detection. With the development of Transformer, the scale of SIRST models is constantly increasing. Due to the limited training samples, performance has not been improved accordingly. The quality, quantity, and diversity of the infrared dataset are critical to the detection of small targets. To highlight this issue, we propose a negative sample augmentation method in this paper. Specifically, a negative augmentation approach is proposed to generate massive negatives for self-supervised learning. Firstly, we perform a sequential noise modeling technology to generate realistic infrared data. Secondly, we fuse the extracted noise with the original data to facilitate diversity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification · Machine Learning in Materials Science · Infrared Target Detection Methodologies
MethodsAttention Is All You Need · Linear Layer · Dropout · Multi-Head Attention · Layer Normalization · Absolute Position Encodings · Softmax · Dense Connections · Label Smoothing · Adam
