Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback

Parker Whitfill; Stewy Slocum

arXiv:2508.08486·cs.AI·August 13, 2025

Beyond Ordinal Preferences: Why Alignment Needs Cardinal Human Feedback

Parker Whitfill, Stewy Slocum

PDF

Open Access 1 Datasets

TL;DR

This paper demonstrates that using only ordinal human preferences is fundamentally limited for aligning language models, and shows that collecting cardinal feedback significantly improves model alignment and performance.

Contribution

The paper proves the limitations of ordinal preferences for model alignment and introduces a new dataset of cardinal judgments to enhance fine-tuning methods.

Findings

01

Cardinal feedback enables better tradeoff resolution in model alignment.

02

Models fine-tuned with cardinal data outperform ordinal-only methods on benchmarks.

03

Collected 25,000 cardinal judgments using willingness-to-pay elicitation.

Abstract

Alignment techniques for LLMs rely on optimizing preference-based objectives -- where these preferences are typically elicited as ordinal, binary choices between responses. Recent work has focused on improving label quality or mitigating particular biases, but we identify a more fundamental limitation: these methods collect the wrong kind of data. We prove an impossibility result: no algorithm relying solely on ordinal comparisons can systematically recover the most preferred model. Intuitively, ordinal data lacks the information needed to resolve tradeoffs -- e.g., fixing a factual error on one prompt versus improving style on another. We show that selecting the optimal model requires recovering preferences over \emph{models} (rather than just responses), which can only be identified given cardinal feedback about response quality. To address this, we collect and publicly release a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

cardinal-prefs/CardinalPrefs
dataset· 3 dl
3 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Ethics and Social Impacts of AI