GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation

Andrey V. Galichin; Mikhail Pautov; Alexey Zhavoronkin; Oleg Y. Rogov,; Ivan Oseledets

arXiv:2405.07562·cs.LG·May 14, 2024

GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation

Andrey V. Galichin, Mikhail Pautov, Alexey Zhavoronkin, Oleg Y. Rogov,, Ivan Oseledets

PDF

Open Access 1 Repo

TL;DR

This paper introduces GLiRA, a novel black-box membership inference attack leveraging knowledge distillation to significantly improve attack efficiency and outperform existing methods across various image classification models.

Contribution

We propose GLiRA, a distillation-guided approach that enhances likelihood ratio-based membership inference attacks in black-box neural networks, addressing a key privacy vulnerability.

Findings

01

GLiRA outperforms current state-of-the-art attacks in black-box settings.

02

Knowledge distillation improves the likelihood ratio attack efficiency.

03

The method is effective across multiple datasets and models.

Abstract

While Deep Neural Networks (DNNs) have demonstrated remarkable performance in tasks related to perception and control, there are still several unresolved concerns regarding the privacy of their training data, particularly in the context of vulnerability to Membership Inference Attacks (MIAs). In this paper, we explore a connection between the susceptibility to membership inference attacks and the vulnerability to distillation-based functionality stealing attacks. In particular, we propose {GLiRA}, a distillation-guided approach to membership inference attack on the black-box neural network. We observe that the knowledge distillation significantly improves the efficiency of likelihood ratio of membership inference attack, especially in the black-box setting, i.e., when the architecture of the target model is unknown to the attacker. We evaluate the proposed method across multiple image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

AIRI-Institute/GLiRA
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsKnowledge Distillation