Membership Inference Attacks by Exploiting Loss Trajectory

Yiyong Liu; Zhengyu Zhao; Michael Backes; Yang Zhang

arXiv:2208.14933·cs.CR·September 1, 2022

Membership Inference Attacks by Exploiting Loss Trajectory

Yiyong Liu, Zhengyu Zhao, Michael Backes, Yang Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces extsystem, a novel membership inference attack leveraging the entire training loss trajectory, significantly improving attack success over existing methods especially when losses are similar.

Contribution

The paper proposes a new attack method that exploits the full training process, using knowledge distillation to access intermediate model losses for better membership inference.

Findings

01

Achieves at least 6× higher true-positive rate on CINIC-10.

02

Effective across different datasets and model architectures.

03

Outperforms existing methods in low false-positive scenarios.

Abstract

Machine learning models are vulnerable to membership inference attacks in which an adversary aims to predict whether or not a particular sample was contained in the target model's training dataset. Existing attack methods have commonly exploited the output information (mostly, losses) solely from the given target model. As a result, in practical scenarios where both the member and non-member samples yield similarly small losses, these methods are naturally unable to differentiate between them. To address this limitation, in this paper, we propose a new attack method, called \system, which can exploit the membership information from the whole training process of the target model for improving the attack performance. To mount the attack in the common black-box setting, we leverage knowledge distillation, and represent the membership information by the losses evaluated on a sequence of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dennisliu2022/membership-inference-attacks-by-exploiting-loss-trajectory
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Network Security and Intrusion Detection