Rethinking Representations for Cross-Domain Infrared Small Target Detection: A Generalizable Perspective from the Frequency Domain
Yimin Fu, Songbo Wang, Feiyan Wu, Jialin Lyu, Zhunga Liu, Michael K. Ng

TL;DR
This paper introduces S$^2$CPNet, a novel frequency domain approach for infrared small target detection that enhances cross-domain generalization by addressing spectral inconsistencies and domain-specific biases.
Contribution
It proposes a frequency perspective to IRSTD, including a phase rectification module and orthogonal attention, to improve generalization across unseen domains.
Findings
Achieves state-of-the-art results on three IRSTD datasets.
Effectively reduces performance degradation in cross-domain scenarios.
Demonstrates robustness against distribution shifts in observational conditions.
Abstract
The accurate target-background separation in infrared small target detection (IRSTD) highly depends on the discriminability of extracted representations. However, most existing methods are confined to domain-consistent settings, while overlooking whether such discriminability can generalize to unseen domains. In practice, distribution shifts between training and testing data are inevitable due to variations in observational conditions and environmental factors. Meanwhile, the intrinsic indistinctiveness of infrared small targets aggravates overfitting to domain-specific patterns. Consequently, the detection performance of models trained on source domains can be severely degraded when deployed in unseen domains. To address this challenge, we propose a spatial-spectral collaborative perception network (SCPNet) for cross-domain IRSTD. Moving beyond conventional spatial learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
