Text-Guided Channel Perturbation and Pretrained Knowledge Integration for Unified Multi-Modality Image Fusion
Xilai Li, Xiaosong Li, Weijun Jiang

TL;DR
This paper introduces a unified multi-modality image fusion framework that leverages channel perturbation and pre-trained knowledge to improve fusion quality and generalization across different modalities.
Contribution
It proposes novel modules like SCPM, GAM, and TCPM to enhance feature discrimination and reduce modality dependence, advancing multi-modality image fusion techniques.
Findings
Outperforms existing methods on multi-modality image fusion tasks.
Enhances feature discrimination using semantic-aware channel pruning.
Reduces modality dependence through text-guided channel perturbation.
Abstract
Multi-modality image fusion enhances scene perception by combining complementary information. Unified models aim to share parameters across modalities for multi-modality image fusion, but large modality differences often cause gradient conflicts, limiting performance. Some methods introduce modality-specific encoders to enhance feature perception and improve fusion quality. However, this strategy reduces generalisation across different fusion tasks. To overcome this limitation, we propose a unified multi-modality image fusion framework based on channel perturbation and pre-trained knowledge integration (UP-Fusion). To suppress redundant modal information and emphasize key features, we propose the Semantic-Aware Channel Pruning Module (SCPM), which leverages the semantic perception capability of a pre-trained model to filter and enhance multi-modality feature channels. Furthermore, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Image Enhancement Techniques · Advanced Image Processing Techniques
