Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
Tingxu Han, Weisong Sun, Ziqi Ding, Chunrong Fang, Hanwei Qian, Jiaxun Li, Zhenyu Chen, Xiangyu Zhang

TL;DR
This paper introduces MIMIC, a mutual information guided backdoor mitigation method for pre-trained SSL encoders, which effectively reduces backdoor success rates using minimal clean data by distilling clean features from a teacher encoder.
Contribution
MIMIC is the first approach to use mutual information and knowledge distillation with random initialization to mitigate backdoors in pre-trained encoders without label data.
Findings
MIMIC significantly reduces backdoor success rates.
MIMIC outperforms seven state-of-the-art mitigation techniques.
Effective with less than 5% clean data.
Abstract
Self-supervised learning (SSL) is increasingly attractive for pre-training encoders without requiring labeled data. Downstream tasks built on top of those pre-trained encoders can achieve nearly state-of-the-art performance. The pre-trained encoders by SSL, however, are vulnerable to backdoor attacks as demonstrated by existing studies. Numerous backdoor mitigation techniques are designed for downstream task models. However, their effectiveness is impaired and limited when adapted to pre-trained encoders, due to the lack of label information when pre-training. To address backdoor attacks against pre-trained encoders, in this paper, we innovatively propose a mutual information guided backdoor mitigation technique, named MIMIC. MIMIC treats the potentially backdoored encoder as the teacher net and employs knowledge distillation to distill a clean student encoder from the teacher net.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Real-time simulation and control systems · Fault Detection and Control Systems
MethodsKnowledge Distillation
