CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion
Zixiang Zhao, Haowen Bai, Jiangshe Zhang, Yulun Zhang, Shuang Xu, Zudi, Lin, Radu Timofte, Luc Van Gool

TL;DR
CDDFuse introduces a novel multi-modality image fusion network that effectively decomposes and fuses cross-modality features using correlation-driven loss and dual-branch Transformer-CNN architecture, improving performance in various fusion tasks.
Contribution
The paper presents a new correlation-driven dual-branch Transformer-CNN network for multi-modality image fusion, effectively modeling cross-modality features and decomposing shared and specific features.
Findings
Achieves state-of-the-art results in infrared-visible and medical image fusion.
Enhances downstream tasks like semantic segmentation and object detection.
Demonstrates robustness across multiple fusion benchmarks.
Abstract
Multi-modality (MM) image fusion aims to render fused images that maintain the merits of different modalities, e.g., functional highlight and detailed textures. To tackle the challenge in modeling cross-modality features and decomposing desirable modality-specific and modality-shared features, we propose a novel Correlation-Driven feature Decomposition Fusion (CDDFuse) network. Firstly, CDDFuse uses Restormer blocks to extract cross-modality shallow features. We then introduce a dual-branch Transformer-CNN feature extractor with Lite Transformer (LT) blocks leveraging long-range attention to handle low-frequency global features and Invertible Neural Networks (INN) blocks focusing on extracting high-frequency local information. A correlation-driven loss is further proposed to make the low-frequency features correlated while the high-frequency features uncorrelated based on the embedded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Remote-Sensing Image Classification · Photoacoustic and Ultrasonic Imaging
MethodsAttention Is All You Need · Layer Normalization · Softmax · Adam · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Linear Layer
