Measuring Fairness of Text Classifiers via Prediction Sensitivity

Satyapriya Krishna; Rahul Gupta; Apurv Verma; Jwala Dhamala; Yada; Pruksachatkun; Kai-Wei Chang

arXiv:2203.08670·cs.LG·March 17, 2022

Measuring Fairness of Text Classifiers via Prediction Sensitivity

Satyapriya Krishna, Rahul Gupta, Apurv Verma, Jwala Dhamala, Yada, Pruksachatkun, Kai-Wei Chang

PDF

TL;DR

This paper introduces a new fairness metric for text classifiers called accumulated prediction sensitivity, which measures how much predictions depend on protected attributes, aligning well with human perceptions and theoretical fairness notions.

Contribution

The paper proposes a novel prediction sensitivity-based fairness metric for text classifiers, linking it to group and individual fairness, and demonstrating its effectiveness through experiments.

Findings

01

The new metric correlates more strongly with human fairness judgments than existing metrics.

02

It has a theoretical basis connecting to statistical and individual fairness.

03

Experimental results on toxicity and bias datasets validate the metric's effectiveness.

Abstract

With the rapid growth in language processing applications, fairness has emerged as an important consideration in data-driven solutions. Although various fairness definitions have been explored in the recent literature, there is lack of consensus on which metrics most accurately reflect the fairness of a system. In this work, we propose a new formulation : ACCUMULATED PREDICTION SENSITIVITY, which measures fairness in machine learning models based on the model's prediction sensitivity to perturbations in input features. The metric attempts to quantify the extent to which a single prediction depends on a protected attribute, where the protected attribute encodes the membership status of an individual in a protected group. We show that the metric can be theoretically linked with a specific notion of group fairness (statistical parity) and individual fairness. It also correlates well with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsJigsaw