Reconstructing Test Labels from Noisy Loss Functions
Abhinav Aggarwal, Shiva Prasad Kasiviswanathan, Zekun Xu, Oluwaseyi, Feyisetan, Nathanael Teissier

TL;DR
This paper investigates the conditions under which private dataset labels can be reconstructed from noisy loss function values, revealing vulnerabilities in common loss functions and proposing effective label inference attacks.
Contribution
It provides a formal analysis of label inference possibilities from noisy loss functions, including conditions, complexity, and attack methods applicable to modern ML models.
Findings
Label inference is possible for many loss functions even with noise.
Designing adversarial prediction vectors is co-NP-hard, but attacks are still feasible.
Attacks can be integrated into neural networks to evade detection.
Abstract
Machine learning classifiers rely on loss functions for performance evaluation, often on a private (hidden) dataset. In a recent line of research, label inference was introduced as the problem of reconstructing the ground truth labels of this private dataset from just the (possibly perturbed) cross-entropy loss function values evaluated at chosen prediction vectors (without any other access to the hidden dataset). In this paper, we formally study the necessary and sufficient conditions under which label inference is possible from \emph{any} (noisy) loss function value. Using tools from analytical number theory, we show that a broad class of commonly used loss functions, including general Bregman divergence-based losses and multiclass cross-entropy with common activation functions like sigmoid and softmax, it is possible to design label inference attacks that succeed even for arbitrary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Cryptography and Data Security
