Different Demographic Cues Yield Inconsistent Conclusions About LLM Personalization and Bias

Manuel Tonneau; Neil K. R. Seghal; Niyati Malhotra; Sharif Kazemi; Victor Orozco-Olvera; Ana Mar\'ia Mu\~noz Boudet; Lakshmi Subramanian; Samuel P. Fraiberger; Sharath Chandra Guntuku; Valentin Hofmann

arXiv:2601.18486·cs.CL·March 24, 2026

Different Demographic Cues Yield Inconsistent Conclusions About LLM Personalization and Bias

Manuel Tonneau, Neil K. R. Seghal, Niyati Malhotra, Sharif Kazemi, Victor Orozco-Olvera, Ana Mar\'ia Mu\~noz Boudet, Lakshmi Subramanian, Samuel P. Fraiberger, Sharath Chandra Guntuku, Valentin Hofmann

PDF

Open Access

TL;DR

This study reveals that using different demographic cues like names or other signals leads to inconsistent conclusions about LLM personalization and bias, emphasizing the importance of multi-cue evaluation for reliable insights.

Contribution

It demonstrates that demographic cues are not interchangeable and that their differences significantly impact assessments of LLM responses, advocating for multi-cue, mechanism-aware evaluation methods.

Findings

01

Cues for the same demographic group produce only partially overlapping response changes.

02

Bias assessments vary in magnitude and direction depending on the cue used.

03

Inconsistencies are due to differences in cue-group association strength and linguistic features.

Abstract

Demographic cue-based evaluation is widely used to study how large language models (LLMs) adapt their responses to signaled demographic attributes within and across groups. This approach typically relies on a single cue (e.g., names) as a proxy for group membership, implicitly treating different cues as interchangeable operationalizations of the same identity-conditioned behavior. We test this assumption in realistic advice-seeking interactions spanning 14.8 million prompts, focusing on race and gender in a U.S. context. We find that cues for the same group induce only partially overlapping changes in model responses, yielding inconsistent conclusions about personalization, while bias conclusions are unstable, with both magnitude and direction of group differences varying across cues. We further show that these inconsistencies reflect differences in cue-group association strength and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution · Topic Modeling · Computational and Text Analysis Methods