Towards Measuring the Representation of Subjective Global Opinions in   Language Models

Esin Durmus; Karina Nguyen; Thomas I. Liao; Nicholas Schiefer; Amanda; Askell; Anton Bakhtin; Carol Chen; Zac Hatfield-Dodds; Danny Hernandez,; Nicholas Joseph; Liane Lovitt; Sam McCandlish; Orowa Sikder; Alex Tamkin,; Janel Thamkul; Jared Kaplan; Jack Clark; Deep Ganguli

arXiv:2306.16388·cs.CL·April 15, 2024·45 cites

Towards Measuring the Representation of Subjective Global Opinions in Language Models

Esin Durmus, Karina Nguyen, Thomas I. Liao, Nicholas Schiefer, Amanda, Askell, Anton Bakhtin, Carol Chen, Zac Hatfield-Dodds, Danny Hernandez,, Nicholas Joseph, Liane Lovitt, Sam McCandlish, Orowa Sikder, Alex Tamkin,, Janel Thamkul, Jared Kaplan, Jack Clark, Deep Ganguli

PDF

Open Access 1 Repo 1 Datasets

TL;DR

This paper introduces a framework and dataset to evaluate how well large language models represent diverse global opinions, revealing biases and the influence of prompts and language translation.

Contribution

It develops a novel quantitative method and dataset for assessing the alignment of LLM responses with global human opinions, highlighting biases and cultural stereotypes.

Findings

01

LLMs tend to reflect opinions of certain countries like the USA and Europe.

02

Prompting the model with a specific country's perspective shifts responses accordingly.

03

Translation of questions does not necessarily align responses with the target language speakers' opinions.

Abstract

Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across different countries. Next, we define a metric that quantifies the similarity between LLM-generated survey responses and human responses, conditioned on country. With our framework, we run three experiments on an LLM trained to be helpful, honest, and harmless with Constitutional AI. By default, LLM responses tend to be more similar to the opinions of certain populations, such as those from the USA, and some European and South American countries, highlighting the potential for biases. When we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

salt-nlp/culturebank
pytorch

Datasets

Anthropic/llm_global_opinions
dataset· 751 dl
751 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Computational and Text Analysis Methods · Topic Modeling

Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)