Language Models Change Facts Based on the Way You Talk
Matthew Kearney, Reuben Binns, Yarin Gal

TL;DR
This paper reveals that large language models (LLMs) are highly sensitive to user identity markers like race, gender, and age, which biases their responses across critical domains such as medicine, law, and employment, potentially causing harm.
Contribution
It provides the first comprehensive analysis of how identity cues in user language influence LLM responses in high-stakes applications and introduces new evaluation tools for this bias.
Findings
LLMs alter medical advice based on ethnicity.
Responses shift with political worldview related to user age.
Salary recommendations vary by user race and gender.
Abstract
Large language models (LLMs) are increasingly being used in user-facing applications, from providing medical consultations to job interview advice. Recent research suggests that these models are becoming increasingly proficient at inferring identity information about the author of a piece of text from linguistic patterns as subtle as the choice of a few words. However, little is known about how LLMs use this information in their decision-making in real-world applications. We perform the first comprehensive analysis of how identity markers present in a user's writing bias LLM responses across five different high-stakes LLM applications in the domains of medicine, law, politics, government benefits, and job salaries. We find that LLMs are extremely sensitive to markers of identity in user queries and that race, gender, and age consistently influence LLM responses in these applications.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
