Harmless Backdoor-based Client-side Watermarking in Federated Learning
Kaijing Luo, Ka-Ho Chow

TL;DR
This paper introduces Sanitizer, a server-side method for secure client watermarking in federated learning that verifies ownership without enabling malicious exploitation, while significantly improving efficiency.
Contribution
The paper presents Sanitizer, a novel approach that confines backdoor watermarks to harmless environments, resolving conflicts and enhancing security in federated learning.
Findings
Achieves near-perfect client contribution verification.
Reduces GPU memory consumption by 85%.
Cuts processing time by at least 5x.
Abstract
Protecting intellectual property (IP) in federated learning (FL) is increasingly important as clients contribute proprietary data to collaboratively train models. Model watermarking, particularly through backdoor-based methods, has emerged as a popular approach for verifying ownership and contributions in deep neural networks trained via FL. By manipulating their datasets, clients can embed a secret pattern, resulting in non-intuitive predictions that serve as proof of participation, useful for claiming incentives or IP co-ownership. However, this technique faces practical challenges: (i) client watermarks can collide, leading to ambiguous ownership claims, and (ii) malicious clients may exploit watermarks to manipulate model predictions for harmful purposes. To address these issues, we propose Sanitizer, a server-side method that ensures client-embedded backdoors can only be activated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Cryptography and Data Security · Privacy-Preserving Technologies in Data
