Towards a Smaller Student: Capacity Dynamic Distillation for Efficient   Image Retrieval

Yi Xie; Huaidong Zhang; Xuemiao Xu; Jianqing Zhu; Shengfeng He

arXiv:2303.09230·cs.CV·October 6, 2023·1 cites

Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval

Yi Xie, Huaidong Zhang, Xuemiao Xu, Jianqing Zhu, Shengfeng He

PDF

Open Access

TL;DR

This paper introduces a Capacity Dynamic Distillation framework that starts with a heavy student model and gradually compresses it during training, achieving faster inference and higher accuracy in image retrieval tasks.

Contribution

It proposes a novel dynamic capacity adjustment method for student models in knowledge distillation, enhancing efficiency without sacrificing performance.

Findings

01

Achieves 67.13% parameter reduction and 65.67% FLOPs reduction on VeRi-776 dataset.

02

Outperforms state-of-the-art methods in inference speed and accuracy.

03

Maintains around 2.11% higher mAP compared to existing approaches.

Abstract

Previous Knowledge Distillation based efficient image retrieval methods employs a lightweight network as the student model for fast inference. However, the lightweight student model lacks adequate representation capacity for effective knowledge imitation during the most critical early training period, causing final performance degeneration. To tackle this issue, we propose a Capacity Dynamic Distillation framework, which constructs a student model with editable representation capacity. Specifically, the employed student model is initially a heavy model to fruitfully learn distilled knowledge in the early training epochs, and the student model is gradually compressed during the training. To dynamically adjust the model capacity, our dynamic framework inserts a learnable convolutional layer within each residual block in the student model as the channel importance indicator. The indicator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Knowledge Distillation · Residual Connection · Batch Normalization · Convolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Residual Block