ValueDCG: Measuring Comprehensive Human Value Understanding Ability of   Language Models

Zhaowei Zhang; Fengshuo Bai; Jun Gao; Yaodong Yang

arXiv:2310.00378·cs.CL·June 18, 2024·1 cites

ValueDCG: Measuring Comprehensive Human Value Understanding Ability of Language Models

Zhaowei Zhang, Fengshuo Bai, Jun Gao, Yaodong Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces ValueDCG, a new metric to evaluate how well large language models understand human values, focusing on both factual knowledge and underlying reasons, revealing that larger models do not necessarily improve in this understanding.

Contribution

The paper proposes a comprehensive evaluation metric, ValueDCG, for assessing LLMs' understanding of human values, emphasizing the importance of both knowledge and reasoning aspects.

Findings

01

LLMs' understanding of human values does not scale with model size.

02

Larger models may generate plausible explanations without true value comprehension.

03

Evaluation reveals potential risks of LLMs misrepresenting human values.

Abstract

Personal values are a crucial factor behind human decision-making. Considering that Large Language Models (LLMs) have been shown to impact human decisions significantly, it is essential to make sure they accurately understand human values to ensure their safety. However, evaluating their grasp of these values is complex due to the value's intricate and adaptable nature. We argue that truly understanding values in LLMs requires considering both "know what" and "know why". To this end, we present a comprehensive evaluation metric, ValueDCG (Value Discriminator-Critique Gap), to quantitatively assess the two aspects with an engineering implementation. We assess four representative LLMs and provide compelling evidence that the growth rates of LLM's "know what" and "know why" capabilities do not align with increases in parameter numbers, resulting in a decline in the models' capacity to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zowiezhang/valuedcg
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods

MethodsMulti-Head Attention · Attention Is All You Need · Dropout · Dense Connections · Linear Layer · Label Smoothing · Adam · Absolute Position Encodings · Residual Connection · Layer Normalization