Distributional Black-Box Model Inversion Attack with Multi-Agent Reinforcement Learning
Huan Bao, Kaimin Wei, Yongdong Wu, Jin Qian, Robert H. Deng

TL;DR
This paper introduces a novel black-box model inversion attack using multi-agent reinforcement learning to construct a probabilistic latent space, enabling more effective recovery of private training data without needing model details.
Contribution
It proposes a distributional MI attack that does not require model parameters or specialized GAN training, utilizing reinforcement learning to better approximate the latent data distribution.
Findings
Outperforms state-of-the-art in attack accuracy
Achieves better K-nearest neighbor feature distance
Improves Peak Signal-to-Noise Ratio
Abstract
A Model Inversion (MI) attack based on Generative Adversarial Networks (GAN) aims to recover the private training data from complex deep learning models by searching codes in the latent space. However, they merely search a deterministic latent space such that the found latent code is usually suboptimal. In addition, the existing distributional MI schemes assume that an attacker can access the structures and parameters of the target model, which is not always viable in practice. To overcome the above shortcomings, this paper proposes a novel Distributional Black-Box Model Inversion (DBB-MI) attack by constructing the probabilistic latent space for searching the target privacy data. Specifically, DBB-MI does not need the target model parameters or specialized GAN training. Instead, it finds the latent probability distribution by combining the output of the target model with multi-agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Smart Grid Security and Resilience
