Data-Efficient Ranking Distillation for Image Retrieval
Zakaria Laskar, Juho Kannala

TL;DR
This paper introduces a data-efficient knowledge distillation method for image retrieval that reduces dependency on large datasets and private models by augmenting training data in the output representation space, achieving competitive results.
Contribution
It proposes a novel distillation approach that works with limited queries, black box teachers, and small datasets without requiring same-dimensional models.
Findings
Outperforms fully supervised models in low-data scenarios
Matches baseline performance with full supervision
Effective on challenging image retrieval datasets
Abstract
Recent advances in deep learning has lead to rapid developments in the field of image retrieval. However, the best performing architectures incur significant computational cost. Recent approaches tackle this issue using knowledge distillation to transfer knowledge from a deeper and heavier architecture to a much smaller network. In this paper we address knowledge distillation for metric learning problems. Unlike previous approaches, our proposed method jointly addresses the following constraints i) limited queries to teacher model, ii) black box teacher model with access to the final output representation, and iii) small fraction of original training data without any ground-truth labels. In addition, the distillation method does not require the student and teacher to have same dimensionality. Addressing these constraints reduces computation requirements, dependency on large-scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
