SFFR: Spatial-Frequency Feature Reconstruction for Multispectral Aerial Object Detection
Xin Zuo, Chenyu Qu, Haibo Zhan, Jifeng Shen, Wankou Yang

TL;DR
This paper introduces SFFR, a novel multispectral object detection method that leverages spatial and frequency domain feature reconstruction using KAN modules, improving robustness and accuracy in UAV imagery.
Contribution
The paper proposes a new frequency component exchange strategy and multi-scale Gaussian KAN modules for enhanced feature fusion in multispectral UAV object detection.
Findings
Outperforms existing methods on SeaDroneSee, DroneVehicle, and DVTOD datasets.
Effectively captures spatial and frequency features for better detection.
Demonstrates robustness to scale variations and multispectral data challenges.
Abstract
Recent multispectral object detection methods have primarily focused on spatial-domain feature fusion based on CNNs or Transformers, while the potential of frequency-domain feature remains underexplored. In this work, we propose a novel Spatial and Frequency Feature Reconstruction method (SFFR) method, which leverages the spatial-frequency feature representation mechanisms of the Kolmogorov-Arnold Network (KAN) to reconstruct complementary representations in both spatial and frequency domains prior to feature fusion. The core components of SFFR are the proposed Frequency Component Exchange KAN (FCEKAN) module and Multi-Scale Gaussian KAN (MSGKAN) module. The FCEKAN introduces an innovative selective frequency component exchange strategy that effectively enhances the complementarity and consistency of cross-modal features based on the frequency feature of RGB and IR images. The MSGKAN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Advanced Image Fusion Techniques
