Soft Knowledge Distillation with Multi-Dimensional Cross-Net Attention for Image Restoration Models Compression
Yongheng Zhang, and Danfeng Yan

TL;DR
This paper introduces a novel soft knowledge distillation method with multi-dimensional cross-net attention for compressing image restoration models, effectively reducing complexity while preserving high restoration quality.
Contribution
It proposes a multi-dimensional attention mechanism and a contrastive loss to improve knowledge distillation in image restoration models, addressing implicit attention relationships.
Findings
Significant reduction in FLOPs and parameters.
Maintained high restoration performance across tasks.
Enhanced feature learning stability and efficiency.
Abstract
Transformer-based encoder-decoder models have achieved remarkable success in image-to-image transfer tasks, particularly in image restoration. However, their high computational complexity-manifested in elevated FLOPs and parameter counts-limits their application in real-world scenarios. Existing knowledge distillation methods in image restoration typically employ lightweight student models that directly mimic the intermediate features and reconstruction results of the teacher, overlooking the implicit attention relationships between them. To address this, we propose a Soft Knowledge Distillation (SKD) strategy that incorporates a Multi-dimensional Cross-net Attention (MCA) mechanism for compressing image restoration models. This mechanism facilitates interaction between the student and teacher across both channel and spatial dimensions, enabling the student to implicitly learn the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Medical Image Segmentation Techniques · Image Processing Techniques and Applications
MethodsSoftmax · Attention Is All You Need · Contrastive Learning · Knowledge Distillation
