Breaking Self-Attention Failure: Rethinking Query Initialization for Infrared Small Target Detection
Yuteng Liu, Duanni Meng, Maoxun Yuan, Xingxing Wei

TL;DR
This paper introduces SEF-DETR, a novel framework that improves infrared small target detection by refining query initialization through frequency-guided patch screening, dynamic embedding enhancement, and reliability-aware fusion, addressing self-attention limitations.
Contribution
The paper proposes SEF-DETR, a new method that enhances IRSTD by refining query initialization to overcome self-attention failures, with modules for frequency-based filtering and multi-scale embedding enhancement.
Findings
SEF-DETR outperforms existing methods on three IRSTD datasets.
The proposed modules effectively suppress background noise and improve target localization.
SEF-DETR demonstrates robustness and efficiency in infrared small target detection.
Abstract
Infrared small target detection (IRSTD) faces significant challenges due to the low signal-to-noise ratio (SNR), small target size, and complex cluttered backgrounds. Although recent DETR-based detectors benefit from global context modeling, they exhibit notable performance degradation on IRSTD. We revisit this phenomenon and reveal that the target-relevant embeddings of IRST are inevitably overwhelmed by dominant background features due to the self-attention mechanism, leading to unreliable query initialization and inaccurate target localization. To address this issue, we propose SEF-DETR, a novel framework that refines query initialization for IRSTD. Specifically, SEF-DETR consists of three components: Frequency-guided Patch Screening (FPS), Dynamic Embedding Enhancement (DEE), and Reliability-Consistency-aware Fusion (RCF). The FPS module leverages the Fourier spectrum of local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrared Target Detection Methodologies · Advanced Neural Network Applications · Remote-Sensing Image Classification
