Debiasing Machine Unlearning with Counterfactual Examples
Ziheng Chen, Jia Wang, Jun Zhuang, Abbavaram Gowtham Reddy, Fabrizio, Silvestri, Jin Huang, Kaushiki Nag, Kun Kuang, Xin Ning, Gabriele Tolomei

TL;DR
This paper addresses bias issues in machine unlearning for the right to be forgotten, proposing a causal, intervention-based method using counterfactual examples to improve unlearning effectiveness and fairness.
Contribution
It introduces a novel approach combining causal analysis and counterfactual examples to mitigate bias in machine unlearning processes.
Findings
Outperforms existing unlearning baselines on evaluation metrics
Effectively reduces data-level and algorithm-level biases
Maintains semantic data consistency during unlearning
Abstract
The right to be forgotten (RTBF) seeks to safeguard individuals from the enduring effects of their historical actions by implementing machine-learning techniques. These techniques facilitate the deletion of previously acquired knowledge without requiring extensive model retraining. However, they often overlook a critical issue: unlearning processes bias. This bias emerges from two main sources: (1) data-level bias, characterized by uneven data removal, and (2) algorithm-level bias, which leads to the contamination of the remaining dataset, thereby degrading model accuracy. In this work, we analyze the causal factors behind the unlearning process and mitigate biases at both data and algorithmic levels. Typically, we introduce an intervention-based approach, where knowledge to forget is erased with a debiased dataset. Besides, we guide the forgetting procedure by leveraging counterfactual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Industrial Vision Systems and Defect Detection · Machine Learning and Algorithms
