Evaluating AI Alignment in LLMs: Output Analysis of Value Priorities Across 75 Models with Human Benchmarking

Gabriel Rongyang Lau; Wei Yan Low; Seow Min Koh; Fiona Fui-Hoon Nah; Andree Hartanto

arXiv:2506.12617·cs.AI·May 19, 2026

Evaluating AI Alignment in LLMs: Output Analysis of Value Priorities Across 75 Models with Human Benchmarking

Gabriel Rongyang Lau, Wei Yan Low, Seow Min Koh, Fiona Fui-Hoon Nah, Andree Hartanto

PDF

TL;DR

This paper introduces a novel output-based method to evaluate AI alignment in LLMs by comparing their expressed value priorities with human judgments across 75 models, revealing both similarities and systematic differences.

Contribution

It develops a scalable profile-fidelity metric for auditing LLMs' value alignment with humans, highlighting the stability and variability of their value-priority structures.

Findings

01

LLM outputs show high stability within models and similar value-priority structures across models.

02

Most models replicate human value orderings but often exaggerate differences between priorities.

03

Profile fidelity varies widely and does not correlate strongly with model size or recency.

Abstract

Large language models (LLMs) are increasingly used in human-AI interaction research and practice, yet existing capability and safety benchmarks reveal little about the value priorities these systems express or how those priorities correspond to human judgements. Across three studies, we introduce an output-based approach to evaluating one facet of AI alignment by treating LLM-generated text as behavioural data and comparing expressed value-priority profiles with a human reference. Study 1 used inductive qualitative analysis to derive six themes of optimal AI functioning, namely Performance, Adaptive Capacity, Social Good, Ethics and Responsibility, Relational Integration, and Agency. Study 2 showed that LLM outputs were highly stable within models and converged on a common value-priority structure across models, indicating reliable and comparable value profiles. Study 3 benchmarked 75…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBig Data and Business Intelligence · Digital Transformation in Industry