Robust Knowledge Distillation Based on Feature Variance Against   Backdoored Teacher Model

Jinyin Chen; Xiaoming Zhao; Haibin Zheng; Xiao Li; Sheng Xiang,; Haifeng Guo

arXiv:2406.03409·cs.LG·June 6, 2024

Robust Knowledge Distillation Based on Feature Variance Against Backdoored Teacher Model

Jinyin Chen, Xiaoming Zhao, Haibin Zheng, Xiao Li, Sheng Xiang,, Haifeng Guo

PDF

Open Access 1 Repo

TL;DR

This paper introduces RobustKD, a knowledge distillation method that effectively compresses models while mitigating backdoor threats by focusing on feature variance, ensuring robustness and high performance across various models.

Contribution

RobustKD is a novel approach that enhances model robustness against backdoors during knowledge distillation by leveraging feature variance, while maintaining high task performance.

Findings

01

RobustKD achieves comparable performance to teacher models.

02

It effectively mitigates backdoor effects in student models.

03

Demonstrates robustness across multiple model architectures.

Abstract

Benefiting from well-trained deep neural networks (DNNs), model compression have captured special attention for computing resource limited equipment, especially edge devices. Knowledge distillation (KD) is one of the widely used compression techniques for edge deployment, by obtaining a lightweight student model from a well-trained teacher model released on public platforms. However, it has been empirically noticed that the backdoor in the teacher model will be transferred to the student model during the process of KD. Although numerous KD methods have been proposed, most of them focus on the distillation of a high-performing student model without robustness consideration. Besides, some research adopts KD techniques as effective backdoor mitigation tools, but they fail to perform model compression at the same time. Consequently, it is still an open problem to well achieve two objectives…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Xming-Z/RobustKD
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational Technology and Assessment

MethodsFocus · Knowledge Distillation