Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items
Jongwook Han, Dongmin Choi, Woojung Song, Eun-Ju Lee, Yohan Jo

TL;DR
This paper introduces the Value Portrait benchmark, a psychometrically validated framework for assessing language models' values based on real-life interactions and human ratings, revealing models' value priorities and biases.
Contribution
The paper presents a novel benchmark that evaluates LLMs' values using ecologically valid items rated by humans, improving assessment relevance and reliability.
Findings
LLMs prioritize Benevolence, Security, and Self-Direction.
Models show biases in perceiving demographic groups.
Benchmark correlates well with human value scores.
Abstract
The importance of benchmarks for assessing the values of language models has been pronounced due to the growing need of more authentic, human-aligned responses. However, existing benchmarks rely on human or machine annotations that are vulnerable to value-related biases. Furthermore, the tested scenarios often diverge from real-world contexts in which models are commonly used to generate text and express values. To address these issues, we propose the Value Portrait benchmark, a reliable framework for evaluating LLMs' value orientations with two key characteristics. First, the benchmark consists of items that capture real-life user-LLM interactions, enhancing the relevance of assessment results to real-world LLM usage. Second, each item is rated by human subjects based on its similarity to their own thoughts, and correlations between these ratings and the subjects' actual value scores…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI · AI in Service Interactions
