Black-Box Privacy Attacks on Shared Representations in Multitask Learning

John Abascal; Nicol\'as Berrios; Alina Oprea; Jonathan Ullman; Adam Smith; Matthew Jagielski

arXiv:2506.16460·cs.LG·June 23, 2025

Black-Box Privacy Attacks on Shared Representations in Multitask Learning

John Abascal, Nicol\'as Berrios, Alina Oprea, Jonathan Ullman, Adam Smith, Matthew Jagielski

PDF

Open Access 3 Reviews

TL;DR

This paper reveals that shared representations in multitask learning can unintentionally leak sensitive task information through black-box inference attacks, even without access to training data, highlighting privacy vulnerabilities.

Contribution

It introduces a novel black-box attack model for inferring task membership from shared representations in multitask learning, with theoretical and empirical validation across domains.

Findings

01

Black-box attacks can successfully infer task membership from shared representations.

02

Even with only fresh samples, adversaries can breach privacy without access to training data.

03

Theoretical analysis shows a clear separation between training and fresh sample adversaries.

Abstract

Multitask learning (MTL) has emerged as a powerful paradigm that leverages similarities among multiple learning tasks, each with insufficient samples to train a standalone model, to solve them simultaneously while minimizing data sharing across users and organizations. MTL typically accomplishes this goal by learning a shared representation that captures common structure among the tasks by embedding data from all tasks into a common feature space. Despite being designed to be the smallest unit of shared information necessary to effectively learn patterns across multiple tasks, these shared representations can inadvertently leak sensitive information about the particular tasks they were trained on. In this work, we investigate what information is revealed by the shared representations through the lens of inference attacks. Towards this, we propose a novel, black-box task-inference…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 4Confidence 4

Strengths

- The paper introduces a new threat model that examines how shared representations in multi-task learning can lead to privacy leakage. - It presents a black-box attack formulated under both weak and strong adversary assumptions, which makes the study applicable to real-world conditions. - A theoretical discussion is provided to explain the connection between representation sharing and privacy risks, highlighting the main factors that affect the effectiveness of the attack.

Weaknesses

- There is a noticeable difference in performance between the weak and strong adversary scenarios. - The paper does not explore or evaluate any potential defense strategies against the proposed attack. - The method used to determine the attack’s decision threshold seems ad hoc, with limited justification or analysis of how it might perform in more realistic environments.

Reviewer 02Rating 6Confidence 4

Strengths

- This paper presents a novel and highly relevant threat model, "task-inference," which generalizes sample-level membership inference to the task level. This is a more practical and realistic threat for collaborative learning paradigms like MTL and federated learning, where the unit of privacy is often an entire user or data silo. - The proposed attacks are efficient and operate under minimal adversarial assumptions. A significant strength is that they are purely black-box and do not require tra

Weaknesses

- The paper does not clearly state whether the primary experimental results (e.g., in Figure 1 and 2) are averaged over multiple independent MTL training runs. While ablations mention multiple runs, the stability of the main attack results against different model initializations is not explicitly confirmed. - The practical utility of the "blind" percentile-based thresholds is questionable. As shown in Table 1, these thresholds can result in very high false positive rates (e.g., 47.6% or 41.7%),

Reviewer 03Rating 6Confidence 2

Strengths

* The paper instroduces a novel task-inference threat model that generalizes the sample-level membership inference to the task level. * The attachs presented a black box and is not limited by the requirement of shadow models nor labeled reference data. * The empirical evaluations span both vision and language areas and use multiple datasets (CelebA, FEMNIST, StackOverflow) and MTL two common MTL cases (personalization and multi-problem learning). * The attack algorithms are clearly described

Weaknesses

* It is not clear how this novel attack compares to existing relevant baselines in these inference attachks (such as potentially those requiring stronger assumptions for the adversary).

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Domain Adaptation and Few-Shot Learning