Using AI for User Representation: An Analysis of 83 Persona Prompts

Joni Salminen; Danial Amin; Bernard Jansen

arXiv:2508.13047·cs.HC·March 4, 2026

Using AI for User Representation: An Analysis of 83 Persona Prompts

Joni Salminen, Danial Amin, Bernard Jansen

PDF

TL;DR

This paper analyzes 83 prompts used with large language models to generate user personas, revealing trends, common formats, and methodological practices in computational user representation.

Contribution

It provides a comprehensive analysis of persona prompts, highlighting prevalent formats, attributes, and research practices, and discusses implications for user modeling.

Findings

01

Most prompts generate single, concise personas.

02

Text and numbers are the main formats for persona attributes.

03

Structured formats like JSON are frequently required.

Abstract

We analyzed 83 persona prompts from 27 research articles that used large language models (LLMs) to generate user personas. Findings show that the prompts predominantly generate single personas. Several prompts express a desire for short or concise persona descriptions, which deviates from the tradition of creating rich, informative, and rounded persona profiles. Text is the most common format for generated persona attributes, followed by numbers. Text and numbers are often generated together, and demographic attributes are included in nearly all generated personas. Researchers use up to 12 prompts in a single study, though most research uses a small number of prompts. Comparison and testing multiple LLMs is rare. More than half of the prompts require the persona output in a structured format, such as JSON, and 74% of the prompts insert data or dynamic variables. We discuss the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.