Causal Unlearning in Collaborative Optimization: Exact and Approximate Influence Reversal under Adversarial Contributions
Ali Mahdavi, Azadeh Zamanifar, Amirfarhad Farhadi, Omid Kashefi

TL;DR
The paper introduces HF-KCU, an efficient influence reversal method for federated learning that supports data deletion with minimal retraining, robustness against adversarial perturbations, and interpretability.
Contribution
HF-KCU provides a scalable, influence-based approach for data deletion in federated learning, with theoretical guarantees and practical validation across multiple architectures and datasets.
Findings
Achieves 47.75x speedup over retraining on CIFAR-10
Maintains test accuracy within 0.60% of baseline after influence reversal
Effectively resists membership inference attacks on the forget set
Abstract
Federated learning systems must support data deletion requests to comply with privacy regulations, yet retraining from scratch after each deletion is computationally prohibitive. We present HF-KCU, a method that removes a client's contribution by approximating the influence function through conjugate gradient iterations in Krylov subspaces, reducing complexity from O(d^3) to O(kd) where k<<d.A causal weighting mechanism ensures that only clients holding the deleted data receive parameter updates, preventing spurious changes to unaffected clients. Our method is designed to handle bounded adversarial perturbations to the Hessian and gradient, providing graceful degradation under realistic threat models. We validate HF-KCU across convolutional (ResNet-18, SimpleCNN) and transformer (ViT-Lite) architectures on CIFAR-10, MNIST, and Fashion-MNIST. On CIFAR-10 under Dirichlet (alpha=0.5)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
