CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations

Myra Cheng; Tiziano Piccardi; Diyi Yang

arXiv:2310.11501·cs.CL·October 19, 2023·2 cites

CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations

Myra Cheng, Tiziano Piccardi, Diyi Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces CoMPosT, a framework for evaluating the quality of LLM simulations of human personas, focusing on caricature susceptibility, and finds that certain demographics and topics are prone to exaggerated stereotypes.

Contribution

The paper presents CoMPosT, a novel multidimensional framework for characterizing and assessing caricature in LLM-generated human simulations.

Findings

01

GPT-4 simulations of political and marginalized groups show high caricature susceptibility.

02

Certain topics lead to more exaggerated stereotypes in LLM simulations.

03

The framework enables systematic evaluation of simulation quality and bias.

Abstract

Recent work has aimed to capture nuances of human behavior by using LLMs to simulate responses from particular demographics in settings like social science experiments and public opinion surveys. However, there are currently no established ways to discuss or evaluate the quality of such LLM simulations. Moreover, there is growing concern that these LLM simulations are flattened caricatures of the personas that they aim to simulate, failing to capture the multidimensionality of people and perpetuating stereotypes. To bridge these gaps, we present CoMPosT, a framework to characterize LLM simulations using four dimensions: Context, Model, Persona, and Topic. We use this framework to measure open-ended LLM simulations' susceptibility to caricature, defined via two criteria: individuation and exaggeration. We evaluate the level of caricature in scenarios from existing work on LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

myracheng/lm_caricature
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Opinion Dynamics and Social Influence · Social Media and Politics

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Linear Layer · Residual Connection · Absolute Position Encodings · Layer Normalization · Softmax · Adam · Byte Pair Encoding