Uncovering Latent Biases in Text: Method and Application to Peer Review
Emaad Manzoor, Nihar B. Shah

TL;DR
This paper introduces a novel framework to quantify and causally link biases in text to subgroup visibility, demonstrated through analyzing peer review texts before and after double-blind policy implementation.
Contribution
The work develops a nonparametric estimation method and an identification strategy to detect biases in text caused by subgroup visibility, validated on real peer review data.
Findings
Biases in peer review texts were detected without access to review ratings.
The framework accurately identified biases before and after double-blind policy change.
Evidence of biases in review ratings was corroborated by textual analysis.
Abstract
Quantifying systematic disparities in numerical quantities such as employment rates and wages between population subgroups provides compelling evidence for the existence of societal biases. However, biases in the text written for members of different subgroups (such as in recommendation letters for male and non-male candidates), though widely reported anecdotally, remain challenging to quantify. In this work, we introduce a novel framework to quantify bias in text caused by the visibility of subgroup membership indicators. We develop a nonparametric estimation and inference procedure to estimate this bias. We then formalize an identification strategy to causally link the estimated bias to the visibility of subgroup membership indicators, provided observations from time periods both before and after an identity-hiding policy change. We identify an application wherein "ground truth" bias…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsComputational and Text Analysis Methods
