Retro: Reusing teacher projection head for efficient embedding   distillation on Lightweight Models via Self-supervised Learning

Khanh-Binh Nguyen; Chae Jung Park

arXiv:2405.15311·cs.CV·August 27, 2024

Retro: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning

Khanh-Binh Nguyen, Chae Jung Park

PDF

Open Access 1 Repo

TL;DR

This paper introduces extsc{Retro}, a method that reuses the teacher's projection head for efficient self-supervised embedding distillation in lightweight models, achieving superior performance with fewer parameters.

Contribution

extsc{Retro} is a novel approach that reuses the teacher's projection head, improving lightweight model distillation in self-supervised learning.

Findings

01

Significant performance improvements on ImageNet with lightweight models.

02

Efficient distillation with fewer parameters.

03

Outperforms state-of-the-art on all tested models.

Abstract

Self-supervised learning (SSL) is gaining attention for its ability to learn effective representations with large amounts of unlabeled data. Lightweight models can be distilled from larger self-supervised pre-trained models using contrastive and consistency constraints. Still, the different sizes of the projection heads make it challenging for students to mimic the teacher's embedding accurately. We propose \textsc{Retro}, which reuses the teacher's projection head for students, and our experimental results demonstrate significant improvements over the state-of-the-art on all lightweight models. For instance, when training EfficientNet-B0 using ResNet-50/101/152 as teachers, our approach improves the linear result on ImageNet to $66.9%$ , $69.3%$ , and $69.8%$ , respectively, with significantly fewer parameters.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

beandkay/epass
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications