Reinforcement Learning-Based Black-Box Model Inversion Attacks

Gyojin Han; Jaehyun Choi; Haeil Lee; Junmo Kim

arXiv:2304.04625·cs.LG·April 11, 2023·1 cites

Reinforcement Learning-Based Black-Box Model Inversion Attacks

Gyojin Han, Jaehyun Choi, Haeil Lee, Junmo Kim

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning-based black-box model inversion attack that effectively reconstructs private data from models, outperforming existing methods and emphasizing the need for privacy-preserving techniques.

Contribution

It formulates the inversion attack as an MDP and uses reinforcement learning to improve attack success in black-box settings, achieving state-of-the-art results.

Findings

01

Successfully reconstructs private data across various datasets and models.

02

Outperforms existing black-box inversion attack methods.

03

Highlights the importance of privacy-preserving machine learning.

Abstract

Model inversion attacks are a type of privacy attack that reconstructs private data used to train a machine learning model, solely by accessing the model. Recently, white-box model inversion attacks leveraging Generative Adversarial Networks (GANs) to distill knowledge from public datasets have been receiving great attention because of their excellent attack performance. On the other hand, current black-box model inversion attacks that utilize GANs suffer from issues such as being unable to guarantee the completion of the attack process within a predetermined number of query accesses or achieve the same level of performance as white-box attacks. To overcome these limitations, we propose a reinforcement learning-based black-box model inversion attack. We formulate the latent space search as a Markov Decision Process (MDP) problem and solve it with reinforcement learning. Our method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HanGyojin/RLB-MI
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare