Elevating Defenses: Bridging Adversarial Training and Watermarking for   Model Resilience

Janvi Thakkar; Giulio Zizzo; Sergio Maffeis

arXiv:2312.14260·cs.LG·January 9, 2024·1 cites

Elevating Defenses: Bridging Adversarial Training and Watermarking for Model Resilience

Janvi Thakkar, Giulio Zizzo, Sergio Maffeis

PDF

Open Access

TL;DR

This paper presents a novel framework that combines adversarial training and watermarking to enhance model robustness and ownership verification, effectively defending against evasion and model theft attacks.

Contribution

It introduces a new method integrating adversarial training with watermarking by using different perturbation budgets, improving resilience without conflict.

Findings

01

Outperforms baseline in robustness against attacks

02

Resilient to pruning and fine-tuning removal

03

Effective on MNIST and Fashion-MNIST datasets

Abstract

Machine learning models are being used in an increasing number of critical applications; thus, securing their integrity and ownership is critical. Recent studies observed that adversarial training and watermarking have a conflicting interaction. This work introduces a novel framework to integrate adversarial training with watermarking techniques to fortify against evasion attacks and provide confident model verification in case of intellectual property theft. We use adversarial training together with adversarial watermarks to train a robust watermarked model. The key intuition is to use a higher perturbation budget to generate adversarial watermarks compared to the budget used for adversarial training, thus avoiding conflict. We use the MNIST and Fashion-MNIST datasets to evaluate our proposed technique on various model stealing attacks. The results obtained consistently outperform the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Malware Detection Techniques

MethodsPruning