Political Compass or Spinning Arrow? Towards More Meaningful Evaluations   for Values and Opinions in Large Language Models

Paul R\"ottger; Valentin Hofmann; Valentina Pyatkin; Musashi Hinck,; Hannah Rose Kirk; Hinrich Sch\"utze; Dirk Hovy

arXiv:2402.16786·cs.CL·June 6, 2024·6 cites

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models

Paul R\"ottger, Valentin Hofmann, Valentina Pyatkin, Musashi Hinck,, Hannah Rose Kirk, Hinrich Sch\"utze, Dirk Hovy

PDF

Open Access 1 Repo

TL;DR

This paper critiques current evaluation methods for LLMs' values and opinions, advocating for more realistic, unconstrained assessments exemplified by the Political Compass Test, revealing significant variability in model responses.

Contribution

It introduces a more realistic evaluation framework for LLMs' values, highlighting limitations of current multiple-choice methods and proposing open-ended assessments.

Findings

01

Models respond differently when not forced into multiple-choice format.

02

Answers vary based on how models are prompted.

03

Open-ended responses show different answer patterns.

Abstract

Much recent work seeks to evaluate values and opinions in large language models (LLMs) using multiple-choice surveys and questionnaires. Most of this work is motivated by concerns around real-world LLM applications. For example, politically-biased LLMs may subtly influence society when they are used by millions of people. Such real-world concerns, however, stand in stark contrast to the artificiality of current evaluations: real users do not typically ask LLMs survey questions. Motivated by this discrepancy, we challenge the prevailing constrained evaluation paradigm for values and opinions in LLMs and explore more realistic unconstrained evaluations. As a case study, we focus on the popular Political Compass Test (PCT). In a systematic review, we find that most prior work using the PCT forces models to comply with the PCT's multiple-choice format. We show that models give substantively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

paul-rottger/llm-values-pct
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsFocus · Perceptual control theoretic architecture