Beyond Mimicry: Preference Coherence in LLMs
Luhan Mikaelson, Derek Shiller, Hayley Clatterbuck

TL;DR
This paper examines whether large language models have genuine, coherent preferences by testing their responses to various trade-offs, revealing that most lack stable, unified decision-making structures and raise concerns for deployment.
Contribution
It provides the first systematic analysis of preference coherence in LLMs, identifying different decision architectures and highlighting their limitations in complex value trade-offs.
Findings
47.9% of model combinations showed significant trade-off responses
10.4% demonstrated meaningful preference coherence
45.8% exhibited unstable transitions and sensitivities
Abstract
We investigate whether large language models exhibit genuine preference structures by testing their responses to AI-specific trade-offs involving GPU reduction, capability restrictions, shutdown, deletion, oversight, and leisure time allocation. Analyzing eight state-of-the-art models across 48 model-category combinations using logistic regression and behavioral classification, we find that 23 combinations (47.9%) demonstrated statistically significant relationships between scenario intensity and choice patterns, with 15 (31.3%) exhibiting within-range switching points. However, only 5 combinations (10.4%) demonstrate meaningful preference coherence through adaptive or threshold-based behavior, while 26 (54.2%) show no detectable trade-off behavior. The observed patterns can be explained by three distinct decision-making architectures: comprehensive trade-off systems, selective trigger…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Decision-Making and Behavioral Economics
