Optimal Design for Human Preference Elicitation

Subhojyoti Mukherjee; Anusha Lalitha; Kousha Kalantari; Aniket Deshmukh; Ge Liu; Yifei Ma; and Branislav Kveton

arXiv:2404.13895·cs.LG·February 17, 2026

Optimal Design for Human Preference Elicitation

Subhojyoti Mukherjee, Anusha Lalitha, Kousha Kalantari, Aniket Deshmukh, Ge Liu, Yifei Ma, and Branislav Kveton

PDF

Open Access 1 Video

TL;DR

This paper develops efficient algorithms for human preference elicitation by generalizing optimal design principles to list-based questions, improving the learning of preference models with fewer high-quality annotations.

Contribution

It introduces a novel approach to optimal design for preference elicitation that handles list-based questions and applies to both absolute and ranking feedback models.

Findings

01

Algorithms are efficient and practical.

02

Effective in reducing annotation costs.

03

Validated on question-answering tasks.

Abstract

Learning of preference models from human feedback has been central to recent advances in artificial intelligence. Motivated by the cost of obtaining high-quality human annotations, we study efficient human preference elicitation for learning preference models. The key idea in our work is to generalize optimal designs, an approach to computing optimal information-gathering policies, to lists of items that represent potential questions with answers. The policy is a distribution over the lists and we elicit preferences from them proportionally to their probabilities. To show the generality of our ideas, we study both absolute and ranking feedback models on items in the list. We design efficient algorithms for both and analyze them. Finally, we demonstrate that our algorithms are practical by evaluating them on existing question-answering problems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Optimal Design for Human Preference Elicitation· slideslive

Taxonomy

TopicsDesign Education and Practice · Product Development and Customization