AdaptGuard: Defending Against Universal Attacks for Model Adaptation

Lijun Sheng; Jian Liang; Ran He; Zilei Wang; Tieniu Tan

arXiv:2303.10594·cs.CR·November 28, 2023·1 cites

AdaptGuard: Defending Against Universal Attacks for Model Adaptation

Lijun Sheng, Jian Liang, Ran He, Zilei Wang, Tieniu Tan

PDF

Open Access 1 Repo

TL;DR

This paper introduces AdaptGuard, a plug-and-play framework that enhances the security of model adaptation against universal adversarial and backdoor attacks, ensuring robustness without needing robust source models.

Contribution

AdaptGuard is a novel, easy-to-integrate method that defends against universal attacks during model adaptation without requiring changes to existing algorithms or robust source models.

Findings

01

Effectively defends against universal attacks in model adaptation

02

Maintains high accuracy on clean target data

03

Validated on multiple datasets and adaptation methods

Abstract

Model adaptation aims at solving the domain transfer problem under the constraint of only accessing the pretrained source models. With the increasing considerations of data privacy and transmission efficiency, this paradigm has been gaining recent popularity. This paper studies the vulnerability to universal attacks transferred from the source domain during model adaptation algorithms due to the existence of malicious providers. We explore both universal adversarial perturbations and backdoor attacks as loopholes on the source side and discover that they still survive in the target models after adaptation. To address this issue, we propose a model preprocessing framework, named AdaptGuard, to improve the security of model adaptation algorithms. AdaptGuard avoids direct use of the risky source parameters through knowledge distillation and utilizes the pseudo adversarial samples under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tomsheng21/adaptguard
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCardiac Arrest and Resuscitation

MethodsKnowledge Distillation