Reading Between the Prompts: How Stereotypes Shape LLM's Implicit Personalization

Vera Neplenbroek; Arianna Bisazza; Raquel Fern\'andez

arXiv:2505.16467·cs.CL·September 17, 2025

Reading Between the Prompts: How Stereotypes Shape LLM's Implicit Personalization

Vera Neplenbroek, Arianna Bisazza, Raquel Fern\'andez

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how large language models infer demographic information from stereotypes in conversations, revealing biases and proposing methods to mitigate stereotype-driven implicit personalization for improved transparency and control.

Contribution

It systematically analyzes LLM responses to stereotypical cues, demonstrating persistent demographic inference and introducing an intervention method to reduce bias.

Findings

01

LLMs infer demographic attributes from stereotypical cues.

02

Stereotype-driven inferences persist even when users specify different identities.

03

Intervening on internal representations can mitigate stereotype-based biases.

Abstract

Generative Large Language Models (LLMs) infer user's demographic information from subtle cues in the conversation -- a phenomenon called implicit personalization. Prior work has shown that such inferences can lead to lower quality responses for users assumed to be from minority groups, even when no demographic information is explicitly provided. In this work, we systematically explore how LLMs respond to stereotypical cues using controlled synthetic conversations, by analyzing the models' latent user representations through both model internals and generated answers to targeted user questions. Our findings reveal that LLMs do infer demographic attributes based on these stereotypical signals, which for a number of groups even persists when the user explicitly identifies with a different demographic group. Finally, we show that this form of stereotype-driven implicit personalization can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

veranep/implicit-personalization-stereotypes
pytorchOfficial

Videos

Reading Between the Prompts: How Stereotypes Shape LLM’s Implicit Personalization· underline

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · AI in Service Interactions