Benchmarking Distributional Alignment of Large Language Models

Nicole Meister; Carlos Guestrin; Tatsunori Hashimoto

arXiv:2411.05403·cs.CL·November 11, 2024

Benchmarking Distributional Alignment of Large Language Models

Nicole Meister, Carlos Guestrin, Tatsunori Hashimoto

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a benchmark to evaluate how well large language models can align their generated opinions with specific demographic groups, addressing a complex aspect of model simulation fidelity.

Contribution

It creates a new dataset and evaluation framework that consider question domain, steering, and distribution expression, advancing understanding of distributional alignment in LMs.

Findings

01

LLMs can more accurately describe opinion distributions than simulate them.

02

Open problems remain on whether and how LMs can truly simulate human opinions.

03

The benchmark highlights the complexity of achieving distributional alignment.

Abstract

Language models (LMs) are increasingly used as simulacra for people, yet their ability to match the distribution of views of a specific demographic group and be \textit{distributionally aligned} remains uncertain. This notion of distributional alignment is complex, as there is significant variation in the types of attributes that are simulated. Prior works have underexplored the role of three critical variables -- the question domain, steering method, and distribution expression method -- which motivates our contribution of a benchmark explicitly addressing these dimensions. We construct a dataset expanding beyond political values, create human baselines for this task, and evaluate the extent to which an LM can align with a particular group's opinion distribution to inform design choices of such simulation systems. Our analysis reveals open problems regarding if, and how, LMs can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nicolemeister/benchmarking-distributional-alignment
noneOfficial

Videos

Benchmarking Distributional Alignment of Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsALIGN