Let All be Whitened: Multi-teacher Distillation for Efficient Visual   Retrieval

Zhe Ma; Jianfeng Dong; Shouling Ji; Zhenguang Liu; Xuhong Zhang,; Zonghui Wang; Sifeng He; Feng Qian; Xiaobo Zhang; Lei Yang

arXiv:2312.09716·cs.CV·December 18, 2023·1 cites

Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval

Zhe Ma, Jianfeng Dong, Shouling Ji, Zhenguang Liu, Xuhong Zhang,, Zonghui Wang, Sifeng He, Feng Qian, Xiaobo Zhang, Lei Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Whiten-MTD, a multi-teacher distillation framework that improves the efficiency of visual retrieval models by transferring knowledge from pre-trained models to lightweight students through a whitening process.

Contribution

The paper proposes a novel whitening technique for multi-teacher distillation in visual retrieval, enabling effective knowledge transfer from diverse models to lightweight retrieval systems.

Findings

01

Effective on multiple datasets for image and video retrieval

02

Balances retrieval accuracy with computational efficiency

03

Outperforms baseline methods in experiments

Abstract

Visual retrieval aims to search for the most relevant visual items, e.g., images and videos, from a candidate gallery with a given query item. Accuracy and efficiency are two competing objectives in retrieval tasks. Instead of crafting a new method pursuing further improvement on accuracy, in this paper we propose a multi-teacher distillation framework Whiten-MTD, which is able to transfer knowledge from off-the-shelf pre-trained retrieval models to a lightweight student model for efficient visual retrieval. Furthermore, we discover that the similarities obtained by different retrieval models are diversified and incommensurable, which makes it challenging to jointly distill knowledge from multiple models. Therefore, we propose to whiten the output of teacher models before fusion, which enables effective multi-teacher distillation for retrieval models. Whiten-MTD is conceptually simple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maryeon/whiten_mtd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning