CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
Myra Cheng, Tiziano Piccardi, Diyi Yang

TL;DR
This paper introduces CoMPosT, a framework for evaluating the quality of LLM simulations of human personas, focusing on caricature susceptibility, and finds that certain demographics and topics are prone to exaggerated stereotypes.
Contribution
The paper presents CoMPosT, a novel multidimensional framework for characterizing and assessing caricature in LLM-generated human simulations.
Findings
GPT-4 simulations of political and marginalized groups show high caricature susceptibility.
Certain topics lead to more exaggerated stereotypes in LLM simulations.
The framework enables systematic evaluation of simulation quality and bias.
Abstract
Recent work has aimed to capture nuances of human behavior by using LLMs to simulate responses from particular demographics in settings like social science experiments and public opinion surveys. However, there are currently no established ways to discuss or evaluate the quality of such LLM simulations. Moreover, there is growing concern that these LLM simulations are flattened caricatures of the personas that they aim to simulate, failing to capture the multidimensionality of people and perpetuating stereotypes. To bridge these gaps, we present CoMPosT, a framework to characterize LLM simulations using four dimensions: Context, Model, Persona, and Topic. We use this framework to measure open-ended LLM simulations' susceptibility to caricature, defined via two criteria: individuation and exaggeration. We evaluate the level of caricature in scenarios from existing work on LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Opinion Dynamics and Social Influence · Social Media and Politics
MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Residual Connection · Absolute Position Encodings · Layer Normalization · Softmax · Adam · Byte Pair Encoding
