Towards Measuring the Representation of Subjective Global Opinions in Language Models
Esin Durmus, Karina Nguyen, Thomas I. Liao, Nicholas Schiefer, Amanda, Askell, Anton Bakhtin, Carol Chen, Zac Hatfield-Dodds, Danny Hernandez,, Nicholas Joseph, Liane Lovitt, Sam McCandlish, Orowa Sikder, Alex Tamkin,, Janel Thamkul, Jared Kaplan, Jack Clark, Deep Ganguli

TL;DR
This paper introduces a framework and dataset to evaluate how well large language models represent diverse global opinions, revealing biases and the influence of prompts and language translation.
Contribution
It develops a novel quantitative method and dataset for assessing the alignment of LLM responses with global human opinions, highlighting biases and cultural stereotypes.
Findings
LLMs tend to reflect opinions of certain countries like the USA and Europe.
Prompting the model with a specific country's perspective shifts responses accordingly.
Translation of questions does not necessarily align responses with the target language speakers' opinions.
Abstract
Large language models (LLMs) may not equitably represent diverse global perspectives on societal issues. In this paper, we develop a quantitative framework to evaluate whose opinions model-generated responses are more similar to. We first build a dataset, GlobalOpinionQA, comprised of questions and answers from cross-national surveys designed to capture diverse opinions on global issues across different countries. Next, we define a metric that quantifies the similarity between LLM-generated survey responses and human responses, conditioned on country. With our framework, we run three experiments on an LLM trained to be helpful, honest, and harmless with Constitutional AI. By default, LLM responses tend to be more similar to the opinions of certain populations, such as those from the USA, and some European and South American countries, highlighting the potential for biases. When we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Computational and Text Analysis Methods · Topic Modeling
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide)
