Towards a Smaller Student: Capacity Dynamic Distillation for Efficient Image Retrieval
Yi Xie, Huaidong Zhang, Xuemiao Xu, Jianqing Zhu, Shengfeng He

TL;DR
This paper introduces a Capacity Dynamic Distillation framework that starts with a heavy student model and gradually compresses it during training, achieving faster inference and higher accuracy in image retrieval tasks.
Contribution
It proposes a novel dynamic capacity adjustment method for student models in knowledge distillation, enhancing efficiency without sacrificing performance.
Findings
Achieves 67.13% parameter reduction and 65.67% FLOPs reduction on VeRi-776 dataset.
Outperforms state-of-the-art methods in inference speed and accuracy.
Maintains around 2.11% higher mAP compared to existing approaches.
Abstract
Previous Knowledge Distillation based efficient image retrieval methods employs a lightweight network as the student model for fast inference. However, the lightweight student model lacks adequate representation capacity for effective knowledge imitation during the most critical early training period, causing final performance degeneration. To tackle this issue, we propose a Capacity Dynamic Distillation framework, which constructs a student model with editable representation capacity. Specifically, the employed student model is initially a heavy model to fruitfully learn distilled knowledge in the early training epochs, and the student model is gradually compressed during the training. To dynamically adjust the model capacity, our dynamic framework inserts a learnable convolutional layer within each residual block in the student model as the channel importance indicator. The indicator…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Knowledge Distillation · Residual Connection · Batch Normalization · Convolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Residual Block
