Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model
Jinyin Chen, Xiaoming Zhao, Haibin Zheng, Xiao Li, Sheng Xiang,, Haifeng Guo

TL;DR
This paper introduces RobustKD, a knowledge distillation method that effectively compresses models while mitigating backdoor threats by focusing on feature variance, ensuring robustness and high performance across various models.
Contribution
RobustKD is a novel approach that enhances model robustness against backdoors during knowledge distillation by leveraging feature variance, while maintaining high task performance.
Findings
RobustKD achieves comparable performance to teacher models.
It effectively mitigates backdoor effects in student models.
Demonstrates robustness across multiple model architectures.
Abstract
Benefiting from well-trained deep neural networks (DNNs), model compression have captured special attention for computing resource limited equipment, especially edge devices. Knowledge distillation (KD) is one of the widely used compression techniques for edge deployment, by obtaining a lightweight student model from a well-trained teacher model released on public platforms. However, it has been empirically noticed that the backdoor in the teacher model will be transferred to the student model during the process of KD. Although numerous KD methods have been proposed, most of them focus on the distillation of a high-performing student model without robustness consideration. Besides, some research adopts KD techniques as effective backdoor mitigation tools, but they fail to perform model compression at the same time. Consequently, it is still an open problem to well achieve two objectives…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Technology and Assessment
MethodsFocus · Knowledge Distillation
