Knowledge Removal in Sampling-based Bayesian Inference
Shaopeng Fu, Fengxiang He, Dacheng Tao

TL;DR
This paper introduces the first machine unlearning algorithm for sampling-based Bayesian inference methods like MCMC, enabling efficient removal of learned knowledge while maintaining model generalizability.
Contribution
It converts the MCMC unlearning problem into an explicit optimization task and designs an influence function to characterize learned knowledge, which is novel.
Findings
The algorithm effectively unlearns data in Gaussian mixture models.
It preserves the generalizability of Bayesian neural networks after unlearning.
Experimental results confirm the method's effectiveness.
Abstract
The right to be forgotten has been legislated in many countries, but its enforcement in the AI industry would cause unbearable costs. When single data deletion requests come, companies may need to delete the whole models learned with massive resources. Existing works propose methods to remove knowledge learned from data for explicitly parameterized models, which however are not appliable to the sampling-based Bayesian inference, i.e., Markov chain Monte Carlo (MCMC), as MCMC can only infer implicit distributions. In this paper, we propose the first machine unlearning algorithm for MCMC. We first convert the MCMC unlearning problem into an explicit optimization problem. Based on this problem conversion, an {\it MCMC influence function} is designed to provably characterize the learned knowledge from data, which then delivers the MCMC unlearning algorithm. Theoretical analysis shows that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsBayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference · Bayesian Modeling and Causal Inference
