Identifying and Mitigating Gender Cues in Academic Recommendation Letters: An Interpretability Case Study

Charlotte S. Alexander; Shane Storks; Souradip Pal; Sayak Chakrabarty; Arushi Sharma; Mlen-Too Wesley; Bailey Russo

arXiv:2604.12337·cs.LG·April 15, 2026

Identifying and Mitigating Gender Cues in Academic Recommendation Letters: An Interpretability Case Study

Charlotte S. Alexander, Shane Storks, Souradip Pal, Sayak Chakrabarty, Arushi Sharma, Mlen-Too Wesley, Bailey Russo

PDF

TL;DR

This study investigates gender bias in academic recommendation letters using transformer models, revealing persistent gender cues even after de-gendering and emphasizing the need for auditing evaluative texts for fairness.

Contribution

The paper demonstrates the presence of implicit gender cues in recommendation letters and evaluates methods to mitigate gender bias, providing a framework for fairer evaluation practices.

Findings

01

Gender cues in LoRs can be detected with up to 68% accuracy.

02

Removing gender cues reduces classifier accuracy by up to 5.5%.

03

Gender prediction remains above chance even after de-gendering.

Abstract

Letters of recommendation (LoRs) can carry patterns of implicitly gendered language that can inadvertently influence downstream decisions, e.g. in hiring and admissions. In this work, we investigate the extent to which Transformer-based encoder models as well as Large Language Models (LLMs) can infer the gender of applicants in academic LoRs submitted to an U.S. medical-residency program after explicit identifiers like names and pronouns are de-gendered. While using three models (DistilBERT, RoBERTa, and Llama 2) to classify the gender of anonymized and de-gendered LoRs, significant gender leakage was observed as evident from up to 68% classification accuracy. Text interpretation methods, like TF-IDF and SHAP, demonstrate that certain linguistic patterns are strong proxies for gender, e.g. "emotional'' and "humanitarian'' are commonly associated with LoRs from female applicants. As an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.