Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models
Joy He-Yueya, Wanjing Anya Ma, Kanishk Gandhi, Benjamin W. Domingue,, Emma Brunskill, Noah D. Goodman

TL;DR
This paper introduces 'psychometric alignment,' a new metric to evaluate how well language models replicate human knowledge distributions, revealing significant misalignments that can be improved with persona prompts and targeted training.
Contribution
The paper proposes a novel metric for assessing LM-human knowledge distribution alignment and demonstrates its effectiveness across multiple domains and model sizes.
Findings
Smaller LMs often align better with human knowledge distributions.
Persona-based prompts improve alignment between LMs and humans.
Training on human response data enhances LM alignment, with domain-dependent effectiveness.
Abstract
Language models (LMs) are increasingly used to simulate human-like responses in scenarios where accurately mimicking a population's behavior can guide decision-making, such as in developing educational materials and designing public policies. The objective of these simulations is for LMs to capture the variations in human responses, rather than merely providing the expected correct answers. Prior work has shown that LMs often generate unrealistically accurate responses, but there are no established metrics to quantify how closely the knowledge distribution of LMs aligns with that of humans. To address this, we introduce "psychometric alignment," a metric that measures the extent to which LMs reflect human knowledge distribution. Assessing this alignment involves collecting responses from both LMs and humans to the same set of test items and using Item Response Theory to analyze the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntelligent Tutoring Systems and Adaptive Learning · Topic Modeling · Online Learning and Analytics
MethodsSparse Evolutionary Training
