UTrace: Poisoning Forensics for Private Collaborative Learning
Evan Rose, Hidde Lycklama, Harsh Chaudhari, Niklas Britz, Anwar Hithnawi, Alina Oprea

TL;DR
UTrace is a novel framework that enables privacy-preserving machine learning systems to trace and attribute model poisoning attacks to specific users without compromising data privacy.
Contribution
UTrace introduces a user-level traceback framework combining gradient similarity and unlearning techniques for accountability in PPML.
Findings
High detection accuracy across multiple poisoning attacks
Low false positive rates in identifying malicious users
Effective on diverse datasets including vision, text, and malware
Abstract
Privacy-preserving machine learning (PPML) systems enable multiple data owners to collaboratively train models without revealing their raw, sensitive data by leveraging cryptographic protocols such as secure multi-party computation (MPC). While PPML offers strong privacy guarantees, it also introduces new attack surfaces: malicious data owners can inject poisoned data into the training process without being detected, thus undermining the integrity of the learned model. Although recent defenses, such as private input validation within MPC, can mitigate some specific poisoning strategies, they remain insufficient, particularly in preventing stealthy or distributed attacks. As the robustness of PPML remains an open challenge, strengthening trust in these systems increasingly necessitates post-hoc auditing mechanisms that instill accountability. In this paper we present UTrace, a framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods · Problem and Project Based Learning · Wikis in Education and Collaboration
MethodsSparse Evolutionary Training
