Target specification bias, counterfactual prediction, and algorithmic   fairness in healthcare

Eran Tal

arXiv:2308.02081·cs.LG·August 7, 2023

Target specification bias, counterfactual prediction, and algorithmic fairness in healthcare

Eran Tal

PDF

TL;DR

This paper highlights target specification bias as a key, often overlooked source of bias in healthcare machine learning, emphasizing its impact on prediction accuracy and decision-making, and proposes solutions inspired by measurement science.

Contribution

It identifies target specification bias as a pervasive issue in healthcare ML and suggests methods from metrology to mitigate its effects.

Findings

01

Target specification bias causes overestimated accuracy and suboptimal decisions.

02

Addressing this bias improves resource utilization and patient outcomes.

03

Proposed solutions draw from measurement science to correct bias.

Abstract

Bias in applications of machine learning (ML) to healthcare is usually attributed to unrepresentative or incomplete data, or to underlying health disparities. This article identifies a more pervasive source of bias that affects the clinical utility of ML-enabled prediction tools: target specification bias. Target specification bias arises when the operationalization of the target variable does not match its definition by decision makers. The mismatch is often subtle, and stems from the fact that decision makers are typically interested in predicting the outcomes of counterfactual, rather than actual, healthcare scenarios. Target specification bias persists independently of data limitations and health disparities. When left uncorrected, it gives rise to an overestimation of predictive accuracy, to inefficient utilization of medical resources, and to suboptimal decisions that can harm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.