Aligning Language Model Benchmarks with Pairwise Preferences

Marco Gutierrez; Xinyi Leng; Hannah Cyberey; Jonathan Richard Schwarz; Ahmed Alaa; Thomas Hartvigsen

arXiv:2602.02898·cs.AI·February 4, 2026

Aligning Language Model Benchmarks with Pairwise Preferences

Marco Gutierrez, Xinyi Leng, Hannah Cyberey, Jonathan Richard Schwarz, Ahmed Alaa, Thomas Hartvigsen

PDF

Open Access

TL;DR

This paper introduces BenchAlign, a method to automatically update language model benchmarks to better predict pairwise model preferences, aligning them with human-like utility and preferences.

Contribution

We propose BenchAlign, the first approach to learn preference-aligned benchmark question weightings using model performance and pairwise preferences, improving ranking accuracy.

Findings

01

Aligned benchmarks accurately rank unseen models by human preferences

02

BenchAlign remains interpretable across different model sizes

03

Our method bridges the gap between benchmarks and real-world utility

Abstract

Language model benchmarks are pervasive and computationally-efficient proxies for real-world performance. However, many recent works find that benchmarks often fail to predict real utility. Towards bridging this gap, we introduce benchmark alignment, where we use limited amounts of information about model performance to automatically update offline benchmarks, aiming to produce new static benchmarks that predict model pairwise preferences in given test settings. We then propose BenchAlign, the first solution to this problem, which learns preference-aligned weight- ings for benchmark questions using the question-level performance of language models alongside ranked pairs of models that could be collected during deployment, producing new benchmarks that rank previously unseen models according to these preferences. Our experiments show that our aligned benchmarks can accurately rank unseen…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications