Reconstructing Test Labels from Noisy Loss Functions

Abhinav Aggarwal; Shiva Prasad Kasiviswanathan; Zekun Xu; Oluwaseyi; Feyisetan; Nathanael Teissier

arXiv:2107.03022·cs.LG·November 2, 2021

Reconstructing Test Labels from Noisy Loss Functions

Abhinav Aggarwal, Shiva Prasad Kasiviswanathan, Zekun Xu, Oluwaseyi, Feyisetan, Nathanael Teissier

PDF

Open Access

TL;DR

This paper investigates the conditions under which private dataset labels can be reconstructed from noisy loss function values, revealing vulnerabilities in common loss functions and proposing effective label inference attacks.

Contribution

It provides a formal analysis of label inference possibilities from noisy loss functions, including conditions, complexity, and attack methods applicable to modern ML models.

Findings

01

Label inference is possible for many loss functions even with noise.

02

Designing adversarial prediction vectors is co-NP-hard, but attacks are still feasible.

03

Attacks can be integrated into neural networks to evade detection.

Abstract

Machine learning classifiers rely on loss functions for performance evaluation, often on a private (hidden) dataset. In a recent line of research, label inference was introduced as the problem of reconstructing the ground truth labels of this private dataset from just the (possibly perturbed) cross-entropy loss function values evaluated at chosen prediction vectors (without any other access to the hidden dataset). In this paper, we formally study the necessary and sufficient conditions under which label inference is possible from \emph{any} (noisy) loss function value. Using tools from analytical number theory, we show that a broad class of commonly used loss functions, including general Bregman divergence-based losses and multiclass cross-entropy with common activation functions like sigmoid and softmax, it is possible to design label inference attacks that succeed even for arbitrary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Cryptography and Data Security