Multi-Teacher Knowledge Distillation with Reinforcement Learning for   Visual Recognition

Chuanguang Yang; Xinqiang Yu; Han Yang; Zhulin An; Chengqing Yu; Libo; Huang; Yongjun Xu

arXiv:2502.18510·cs.CV·February 27, 2025

Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition

Chuanguang Yang, Xinqiang Yu, Han Yang, Zhulin An, Chengqing Yu, Libo, Huang, Yongjun Xu

PDF

Open Access 1 Video

TL;DR

This paper introduces a reinforcement learning-based method for multi-teacher knowledge distillation that dynamically optimizes teacher weights, leading to improved performance across various visual recognition tasks.

Contribution

It proposes a novel RL framework that considers teacher performance and gaps to adaptively assign weights, enhancing multi-teacher knowledge distillation effectiveness.

Findings

01

Achieves state-of-the-art results on image classification, object detection, and segmentation.

02

Demonstrates superior weight optimization over existing methods.

03

Improves student network performance through adaptive teacher weighting.

Abstract

Multi-teacher Knowledge Distillation (KD) transfers diverse knowledge from a teacher pool to a student network. The core problem of multi-teacher KD is how to balance distillation strengths among various teachers. Most existing methods often develop weighting strategies from an individual perspective of teacher performance or teacher-student gaps, lacking comprehensive information for guidance. This paper proposes Multi-Teacher Knowledge Distillation with Reinforcement Learning (MTKD-RL) to optimize multi-teacher weights. In this framework, we construct both teacher performance and teacher-student gaps as state information to an agent. The agent outputs the teacher weight and can be updated by the return reward from the student. MTKD-RL reinforces the interaction between the student and teacher using an agent in an RL-based decision mechanism, achieving better matching capability with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition· underline

Taxonomy

TopicsNeural Networks and Reservoir Computing · Multimodal Machine Learning Applications · Advanced Neural Network Applications