TL;DR
This paper introduces counterfactual evaluation metrics for risk assessment tools used in high-stakes decisions, addressing biases from historical data and improving fairness assessments.
Contribution
It proposes new counterfactual fairness metrics, a doubly robust estimation method, and analyzes the limitations of standard fairness measures in decision-making contexts.
Findings
Counterfactual fairness metrics differ from standard metrics under certain conditions.
Standard fairness methods may worsen imbalance in counterfactual fairness.
Empirical results show improved fairness assessment on synthetic and real data.
Abstract
Algorithmic risk assessments are increasingly used to help humans make decisions in high-stakes settings, such as medicine, criminal justice and education. In each of these cases, the purpose of the risk assessment tool is to inform actions, such as medical treatments or release conditions, often with the aim of reducing the likelihood of an adverse event such as hospital readmission or recidivism. Problematically, most tools are trained and evaluated on historical data in which the outcomes observed depend on the historical decision-making policy. These tools thus reflect risk under the historical policy, rather than under the different decision options that the tool is intended to inform. Even when tools are constructed to predict risk under a specific decision, they are often improperly evaluated as predictors of the target outcome. Focusing on the evaluation task, in this paper we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
