Deep Fourier-embedded Network for RGB and Thermal Salient Object Detection
Pengfei Lyu, Xiaosheng Yu, Pak-Hei Yeung, Chengdong Wu, Jagath C. Rajapakse

TL;DR
This paper introduces FreqSal, a Fourier Transform-based RGB-T salient object detection model that is more efficient and accurate than existing Transformer-based methods, especially for high-resolution images.
Contribution
The paper proposes a novel Fourier Transform-based model with three key components and a frequency-aware loss, addressing memory and frequency gap issues in RGB-T SOD.
Findings
Outperforms 29 state-of-the-art models on 10 benchmarks
Achieves higher accuracy with lower memory usage
Validates effectiveness through extensive ablation studies
Abstract
The rapid development of deep learning has significantly improved salient object detection (SOD) combining both RGB and thermal (RGB-T) images. However, existing Transformer-based RGB-T SOD models with quadratic complexity are memory-intensive, limiting their application in high-resolution bimodal feature fusion. To overcome this limitation, we propose a purely Fourier Transform-based model, namely Deep Fourier-embedded Network (FreqSal), for accurate RGB-T SOD. Specifically, we leverage the efficiency of Fast Fourier Transform with linear complexity to design three key components: (1) To fuse RGB and thermal modalities, we propose Modal-coordinated Perception Attention, which aligns and enhances bimodal Fourier representation in multiple dimensions; (2) To clarify object edges and suppress noise, we design Frequency-decomposed Edge-aware Block, which deeply decomposes and filters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image Fusion Techniques · Infrared Target Detection Methodologies
MethodsSoftmax · Attention Is All You Need · Features Explanation Method
