Bias Redistribution in Visual Machine Unlearning: Does Forgetting One Group Harm Another?

Yunusa Haruna; Adamu Lawan; Ibrahim Haruna Abdulhamid; Hamza Mohammed Dauda; Jiaquan Zhang; Chaoning Zhang; Shamsuddeen Hassan Muhammad

arXiv:2604.08111·cs.LG·April 10, 2026

Bias Redistribution in Visual Machine Unlearning: Does Forgetting One Group Harm Another?

Yunusa Haruna, Adamu Lawan, Ibrahim Haruna Abdulhamid, Hamza Mohammed Dauda, Jiaquan Zhang, Chaoning Zhang, Shamsuddeen Hassan Muhammad

PDF

TL;DR

This paper examines how machine unlearning affects bias redistribution in CLIP models, revealing that forgetting one demographic group often shifts bias to others, especially along gender lines.

Contribution

It uncovers bias redistribution phenomena in CLIP models during unlearning and evaluates methods that mitigate but do not fully prevent bias transfer.

Findings

01

Unlearning redistributes bias mainly along gender boundaries.

02

Removing Young Female transfers performance to Old Female.

03

Refusal Vector reduces redistribution but degrades overall performance.

Abstract

Machine unlearning enables models to selectively forget training data, driven by privacy regulations such as GDPR and CCPA. However, its fairness implications remain underexplored: when a model forgets a demographic group, does it neutralize that concept or redistribute it to correlated groups, potentially amplifying bias? We investigate this bias redistribution phenomenon on CelebA using CLIP models (ViT/B-32, ViT-L/14, ViT-B/16) under a zero-shot classification setting across intersectional groups defined by age and gender. We evaluate three unlearning methods, Prompt Erasure, Prompt Reweighting, and Refusal Vector using per-group accuracy shifts, demographic parity gaps, and a redistribution score. Our results show that unlearning does not eliminate bias but redistributes it primarily along gender rather than age boundaries. In particular, removing the dominant Young Female group…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.