TL;DR
This paper introduces a causal reasoning framework to explain membership inference attacks in neural networks, revealing their complex relationship with generalization and providing a predictive model with high accuracy.
Contribution
It is the first to use causal graphs to explain MI attacks and their link to generalization, challenging prior hypotheses and offering a predictive analytical approach.
Findings
Causal models explain MI attack performance with high accuracy (0.90).
Refutes oversimplified hypotheses about causes of MI attacks.
Reveals a causal connection between generalization and MI attacks.
Abstract
Membership inference (MI) attacks highlight a privacy weakness in present stochastic training methods for neural networks. It is not well understood, however, why they arise. Are they a natural consequence of imperfect generalization only? Which underlying causes should we address during training to mitigate these attacks? Towards answering such questions, we propose the first approach to explain MI attacks and their connection to generalization based on principled causal reasoning. We offer causal graphs that quantitatively explain the observed MI attack performance achieved for attack variants. We refute several prior non-quantitative hypotheses that over-simplify or over-estimate the influence of underlying causes, thereby failing to capture the complex interplay between several factors. Our causal models also show a new connection between generalization and MI attacks via their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
