Transfer of Machine Learning Fairness across Domains
Candice Schumann, Xuezhi Wang, Alex Beutel, Jilin Chen, Hai Qian, Ed, H. Chi

TL;DR
This paper explores how to transfer fairness in machine learning models across different domains, providing theoretical guarantees and practical methods to improve fairness in data-sparse target domains.
Contribution
It introduces a theoretical framework and modeling approach for transferring fairness across domains, addressing challenges of limited data and domain shifts.
Findings
Theoretical guarantees for fairness transfer across domains.
Empirical validation showing improved fairness metrics.
Effective in data-sparse target domains.
Abstract
If our models are used in new or unexpected cases, do we know if they will make fair predictions? Previously, researchers developed ways to debias a model for a single problem domain. However, this is often not how models are trained and used in practice. For example, labels and demographics (sensitive attributes) are often hard to observe, resulting in auxiliary or synthetic data to be used for training, and proxies of the sensitive attribute to be used for evaluation of fairness. A model trained for one setting may be picked up and used in many others, particularly as is common with pre-training and cloud APIs. Despite the pervasiveness of these complexities, remarkably little work in the fairness literature has theoretically examined these issues. We frame all of these settings as domain adaptation problems: how can we use what we have learned in a source domain to debias in a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning
