Prototype-guided Cross-task Knowledge Distillation for Large-scale Models
Deng Li, Aming Wu, Yahong Han, Qi Tian

TL;DR
This paper introduces ProC-KD, a novel knowledge distillation method that transfers local object knowledge from large models to smaller ones across different tasks, enhancing generalization in diverse visual scenarios.
Contribution
The paper proposes a prototype-guided approach with modules for learning generalized features and task-adaptive augmentation, enabling effective cross-task knowledge transfer from large models.
Findings
Improved performance on various visual tasks.
Effective transfer of local object features across tasks.
Enhanced generalization of student models.
Abstract
Recently, large-scale pre-trained models have shown their advantages in many tasks. However, due to the huge computational complexity and storage requirements, it is challenging to apply the large-scale model to real scenes. A common solution is knowledge distillation which regards the large-scale model as a teacher model and helps to train a small student model to obtain a competitive performance. Cross-task Knowledge distillation expands the application scenarios of the large-scale pre-trained model. Existing knowledge distillation works focus on directly mimicking the final prediction or the intermediate layers of the teacher model, which represent the global-level characteristics and are task-specific. To alleviate the constraint of different label spaces, capturing invariant intrinsic local object characteristics (such as the shape characteristics of the leg and tail of the cattle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
