Can Language Models Reason about Individualistic Human Values and Preferences?

Liwei Jiang; Taylor Sorensen; Sydney Levine; Yejin Choi

arXiv:2410.03868·cs.CL·June 3, 2025

Can Language Models Reason about Individualistic Human Values and Preferences?

Liwei Jiang, Taylor Sorensen, Sydney Levine, Yejin Choi

PDF

Open Access 4 Reviews

TL;DR

This paper introduces IndieValueCatalog, a dataset for testing language models' ability to reason about individualistic human values, revealing current models' limitations and the importance of nuanced understanding beyond demographics.

Contribution

The paper presents IndieValueCatalog, a novel dataset for evaluating language models' capacity to understand individualistic values, and analyzes models' performance and biases in this context.

Findings

01

Frontier LMs achieve only 55-65% accuracy in predicting individualistic values.

02

Demographic information alone is insufficient to describe individualistic values.

03

Models show partiality in reasoning about global individualistic values, measured by the {}Inequity Index.

Abstract

Recent calls for pluralistic alignment emphasize that AI systems should address the diverse needs of all people. Yet, efforts in this space often require sorting people into fixed buckets of pre-specified diversity-defining dimensions (e.g., demographics), risking smoothing out individualistic variations or even stereotyping. To achieve an authentic representation of diversity that respects individuality, we propose individualistic alignment. While individualistic alignment can take various forms, we introduce IndieValueCatalog, a dataset transformed from the influential World Values Survey (WVS), to study language models (LMs) on the specific challenge of individualistic value reasoning. Given a sample of an individual's value-expressing statements, models are tasked with predicting this person's value judgments in novel cases. With IndieValueCatalog, we reveal critical limitations in…

Peer Reviews

Decision·ICLR 2025 Conference Withdrawn Submission

Reviewer 01Rating 6Confidence 3

Strengths

1. The proposed dataset is a significant addition, transforming unstructured WVS data into a structured, standardized resource for examining individualistic values. This dataset enables a more granular approach to evaluate the performance of human value reasoning on LLMs. 2. The paper’s critique of pluralistic alignment's reliance on broad demographic categories is thought-provoking. By shifting the focus to individualistic alignment, the authors argue for AI systems that respect individual uni

Weaknesses

1. The reliance on WVS data, while innovative, may limit the applicability of results. Survey responses may not capture the full breadth of individual values, and the transformation of survey items into value-expressing statements could introduce biases or oversimplify complex beliefs. 2. The authors lack an analysis of the task's challenges and fail to sufficiently examine the reasons behind the poor performance of LLMs. Is the subpar performance primarily due to the complexity and contradictio

Reviewer 02Rating 3Confidence 4

Strengths

1. The work tackles a crucial challenge in AI alignment: understanding human values at an individual level rather than relying on broad demographic categories. This bottom-up approach overcomes the limitations of traditional demographic-based models, enabling the development of AI systems that are both more equitable and better tailored to individual needs. 2. The paper's visualization is particularly effective. Figure 1 provides a clear illustration of the author's concept of individualistic va

Weaknesses

1. The methodology lacks novelty. The training of individualistic value reasoner relies solely on fine-tuning approaches; the proposed metrics on LM proficiency and impartiality offer no novel contributions, and the overall methodological approach contains no significant innovations. 2. The paper's analysis lacks sufficient depth and fails to make substantial contributions to the field. While it identifies a key limitation - namely, frontier LLMs' deficiency in understanding and predicting indiv

Reviewer 03Rating 6Confidence 4

Strengths

* This paper focuses on individualistic alignment, which is an interesting and novel topic * This paper utilizes World Values Survey (WVS) data and transforms it into a format suitable for LLM training, creating the INDIEVALUECATALOG dataset * This paper proposes the VALUE INEQUITY INDEX (σINEQUITY) to measure the fairness of model reasoning across different demographic groups, revealing the current limitations of SOTA LLMs in this aspect * Through fine-tuning LLMs on INDIEVALUECATALOG, authors

Weaknesses

* Despite the interesting problem setting, the technical contributions of this paper appear limited for ICLR. * There is insufficient discussion of the practical application value of "individualistic alignment" * The paper lacks performance comparisons with related work

Reviewer 04Rating 3Confidence 2

Strengths

(1) The study of prediction equity across demographic groups is interesting, and the result is insightful. (2) A new metric, the Value Inequity Index (σINEQUITY), is proposed to measure how equitably models treat different demographics. (3) This paper tested multiple LLMs.

Weaknesses

(1) The problem formalization with notations in Section 2.2 is unnecessarily complicated. (2) The focus on predicting individual values may be problematic. Even individuals with the same demographics can differ widely due to various factors. This means the data contains a lot of randomness and noises. This could be why the models struggled to perform well, even after fine-tuning on similar data. A group/demographic-level setting might be more reasonable. (3) Why the studied task is important a

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Computational and Text Analysis Methods · Multi-Agent Systems and Negotiation