A Contrastive Knowledge Transfer Framework for Model Compression and Transfer Learning
Kaiqi Zhao, Yitao Chen, Ming Zhao

TL;DR
This paper introduces CKTF, a contrastive learning framework that transfers structural knowledge from teacher to student models, significantly improving performance in model compression and transfer learning tasks.
Contribution
The paper proposes a novel contrastive knowledge transfer framework that captures intermediate structural knowledge, generalizes existing methods, and enhances model compression and transfer learning performance.
Findings
CKTF outperforms existing KT methods by up to 11.59% in model compression.
CKTF improves transfer learning accuracy by up to 4.75%.
The framework effectively transfers structural knowledge via contrastive objectives.
Abstract
Knowledge Transfer (KT) achieves competitive performance and is widely used for image classification tasks in model compression and transfer learning. Existing KT works transfer the information from a large model ("teacher") to train a small model ("student") by minimizing the difference of their conditionally independent output distributions. However, these works overlook the high-dimension structural knowledge from the intermediate representations of the teacher, which leads to limited effectiveness, and they are motivated by various heuristic intuitions, which makes it difficult to generalize. This paper proposes a novel Contrastive Knowledge Transfer Framework (CKTF), which enables the transfer of sufficient structural knowledge from the teacher to the student by optimizing multiple contrastive objectives across the intermediate representations between them. Also, CKTF provides a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Flow Measurement and Analysis · Advanced Image and Video Retrieval Techniques
