TL;DR
The paper introduces RPFNet, a novel image fusion network that combines residual prior extraction, frequency domain fusion, and bidirectional feature interaction to improve the quality of fused images for high-level vision tasks.
Contribution
RPFNet innovatively integrates residual prior modules, frequency-aware fusion, and cross-promotion mechanisms with specialized loss functions for superior image fusion performance.
Findings
Enhanced texture details and salient object preservation
Effective global and local feature integration
Validated superior performance on multiple datasets
Abstract
Image fusion aims to integrate complementary information across modalities to generate high-quality fused images, thereby enhancing the performance of high-level vision tasks. While global spatial modeling mechanisms show promising results, constructing long-range feature dependencies in the spatial domain incurs substantial computational costs. Additionally, the absence of ground-truth exacerbates the difficulty of capturing complementary features effectively. To tackle these challenges, we propose a Residual Prior-driven Frequency-aware Network, termed as RPFNet. Specifically, RPFNet employs a dual-branch feature extraction framework: the Residual Prior Module (RPM) extracts modality-specific difference information from residual maps, thereby providing complementary priors for fusion; the Frequency Domain Fusion Module (FDFM) achieves efficient global feature modeling and integration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
