Boosting Contrastive Learning with Relation Knowledge Distillation
Kai Zheng, Yuanjiang Wang, Ye Yuan

TL;DR
This paper introduces a relation-wise contrastive learning approach with relation knowledge distillation to improve lightweight self-supervised models, significantly narrowing the performance gap with supervised methods.
Contribution
It proposes a novel relation-wise contrastive paradigm with a heterogeneous teacher for relation knowledge transfer, addressing semantic collapse in lightweight models.
Findings
Achieves a linear evaluation accuracy of 50.1% on AlexNet, close to supervised performance.
Demonstrates significant improvements across multiple lightweight models.
Provides theoretical analysis supporting the effectiveness of relation-wise contrastive learning.
Abstract
While self-supervised representation learning (SSL) has proved to be effective in the large model, there is still a huge gap between the SSL and supervised method in the lightweight model when following the same solution. We delve into this problem and find that the lightweight model is prone to collapse in semantic space when simply performing instance-wise contrast. To address this issue, we propose a relation-wise contrastive paradigm with Relation Knowledge Distillation (ReKD). We introduce a heterogeneous teacher to explicitly mine the semantic information and transferring a novel relation knowledge to the student (lightweight model). The theoretical analysis supports our main concern about instance-wise contrast and verify the effectiveness of our relation-wise contrastive learning. Extensive experimental results also demonstrate that our method achieves significant improvements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
MethodsKnowledge Distillation
