AsymmetryZero: A Framework for Operationalizing Human Expert Preferences as Semantic Evals
Tadhg Looram, Lucas Nuzzi, Kyle Waters, Steven Dillmann

TL;DR
AsymmetryZero is a framework that operationalizes human expert preferences as semantic evaluations, enabling consistent, cost-effective, and comparable model assessments across different evaluation settings.
Contribution
It introduces a task evaluation contract system that explicitly encodes expert criteria, facilitating model-only and agentic evaluations with shared audit artifacts.
Findings
Criterion-level agreement between juries ranges from 75.9% to 89.6%.
Compact juries show higher internal dissent than frontier juries.
Compact juries significantly reduce judging cost and latency while maintaining stable task outcomes.
Abstract
Much of the focus in RL today is on evaluation design: building meaningful evals that serve simultaneously as benchmarks and as well-defined reward signals for post-training. Yet, many real-world tasks are governed by subjective, procedural, and domain-specific requirements that are difficult to encode as exact-match targets or open-ended preference judgments frequently used in RL pipelines today. In this work, we present AsymmetryZero, a framework for operationalizing human expert preferences as semantic evals. AsymmetryZero represents each task as a stable evaluation contract that makes grading criteria explicit: what is being graded, how each criterion is judged, and how criterion-level decisions are aggregated into a task outcome. The same contract can be executed using Inspect for model-only evaluations, as well as the Harbor Framework for agentic evaluations, enabling comparable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
