The Expertise Problem: Learning from Specialized Feedback
Oliver Daniels-Koch, Rachel Freedman

TL;DR
This paper formalizes the expertise problem in reinforcement learning from human feedback, highlighting how varying teacher expertise affects feedback reliability, and provides an open-source benchmark for future research.
Contribution
It introduces the expertise problem in RLHF, extends an existing benchmark to evaluate it, and offers techniques for improved teacher and query selection.
Findings
Demonstrates the impact of teacher expertise variability on RLHF performance.
Provides an open-source benchmark for the expertise problem.
Evaluates state-of-the-art RLHF algorithms under expertise variability.
Abstract
Reinforcement learning from human feedback (RLHF) is a powerful technique for training agents to perform difficult-to-specify tasks. However, human feedback can be noisy, particularly when human teachers lack relevant knowledge or experience. Levels of expertise vary across teachers, and a given teacher may have differing levels of expertise for different components of a task. RLHF algorithms that learn from multiple teachers therefore face an expertise problem: the reliability of a given piece of feedback depends both on the teacher that it comes from and how specialized that teacher is on relevant components of the task. Existing state-of-the-art RLHF algorithms assume that all evaluations come from the same distribution, obscuring this inter- and intra-human variance, and preventing them from accounting for or taking advantage of variations in expertise. We formalize this problem,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning · Machine Learning and Algorithms
