ZEBRA: Leveraging Model-Behavioral Knowledge for Zero-Annotation Preference Dataset Construction

Jeesu Jung; Chanjun Park; Sangkeun Jung

arXiv:2502.18744·cs.AI·June 3, 2025

ZEBRA: Leveraging Model-Behavioral Knowledge for Zero-Annotation Preference Dataset Construction

Jeesu Jung, Chanjun Park, Sangkeun Jung

PDF

Open Access 1 Video

TL;DR

ZEBRA introduces a zero-annotation framework that constructs preference datasets for language model alignment by analyzing model behavior, eliminating the need for manual labeling and reducing costs while maintaining high performance.

Contribution

ZEBRA is the first to leverage model behavior knowledge for preference dataset construction without instance-level annotations, enabling scalable and cost-effective alignment data generation.

Findings

01

Achieves alignment performance comparable to supervised methods

02

Eliminates manual annotation costs

03

Demonstrates scalable preference data construction

Abstract

Recent efforts in LLM alignment have focused on constructing large-scale preference datasets via human or Artificial Intelligence (AI) annotators. However, such approaches rely on instance-wise supervision, incurring substantial annotation cost and limited interpretability. In this paper, we propose ZEBRA - a model behavior-wise zero-annotation framework that constructs preference data by leveraging model behavior knowledge derived from benchmark performances. ZEBRA binarizes response pairs by evaluating the quality and similarity of their origin models, entirely bypassing instance-level annotation. This allows scalable, controllable, and cost-effective alignment data generation. Empirical results show that ZEBRA achieves alignment performance comparable to instance-supervised methods, despite requiring no manual or model-based labeling.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

ZEBRA: Leveraging Model-Behavioral Knowledge for Zero-Annotation Preference Dataset Construction· underline

Taxonomy

TopicsSemantic Web and Ontologies · Bayesian Modeling and Causal Inference · AI-based Problem Solving and Planning