Data Management for Causal Algorithmic Fairness

Babak Salimi; Bill Howe; Dan Suciu

arXiv:1908.07924·cs.DB·October 2, 2019

Data Management for Causal Algorithmic Fairness

Babak Salimi, Bill Howe, Dan Suciu

PDF

Open Access

TL;DR

This paper emphasizes the importance of causal reasoning in data management to address fairness issues in machine learning, highlighting the need for causal approaches over associational ones.

Contribution

It distinguishes between associational and causal fairness, advocating for causal reasoning and reviewing data management techniques applicable to causal fairness.

Findings

01

Causal fairness requires understanding data generation processes.

02

Existing data management techniques can be adapted for causal fairness.

03

Future research opportunities in causal data management are identified.

Abstract

Fairness is increasingly recognized as a critical component of machine learning systems. However, it is the underlying data on which these systems are trained that often reflects discrimination, suggesting a data management problem. In this paper, we first make a distinction between associational and causal definitions of fairness in the literature and argue that the concept of fairness requires causal reasoning. We then review existing works and identify future opportunities for applying data management techniques to causal algorithmic fairness.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Privacy-Preserving Technologies in Data