Psychometric Alignment: Capturing Human Knowledge Distributions via   Language Models

Joy He-Yueya; Wanjing Anya Ma; Kanishk Gandhi; Benjamin W. Domingue,; Emma Brunskill; Noah D. Goodman

arXiv:2407.15645·cs.CL·July 23, 2024·1 cites

Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models

Joy He-Yueya, Wanjing Anya Ma, Kanishk Gandhi, Benjamin W. Domingue,, Emma Brunskill, Noah D. Goodman

PDF

Open Access 1 Repo

TL;DR

This paper introduces 'psychometric alignment,' a new metric to evaluate how well language models replicate human knowledge distributions, revealing significant misalignments that can be improved with persona prompts and targeted training.

Contribution

The paper proposes a novel metric for assessing LM-human knowledge distribution alignment and demonstrates its effectiveness across multiple domains and model sizes.

Findings

01

Smaller LMs often align better with human knowledge distributions.

02

Persona-based prompts improve alignment between LMs and humans.

03

Training on human response data enhances LM alignment, with domain-dependent effectiveness.

Abstract

Language models (LMs) are increasingly used to simulate human-like responses in scenarios where accurately mimicking a population's behavior can guide decision-making, such as in developing educational materials and designing public policies. The objective of these simulations is for LMs to capture the variations in human responses, rather than merely providing the expected correct answers. Prior work has shown that LMs often generate unrealistically accurate responses, but there are no established metrics to quantify how closely the knowledge distribution of LMs aligns with that of humans. To address this, we introduce "psychometric alignment," a metric that measures the extent to which LMs reflect human knowledge distribution. Assessing this alignment involves collecting responses from both LMs and humans to the same set of test items and using Item Response Theory to analyze the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joyheyueya/psychometric-alignment
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Online Learning and Analytics

MethodsSparse Evolutionary Training