Multi-Task Reward Learning from Human Ratings

Mingkang Wu; Devin White; Evelyn Rose; Vernon Lawhern; Nicholas R Waytowich; Yongcan Cao

arXiv:2506.09183·cs.LG·June 19, 2025

Multi-Task Reward Learning from Human Ratings

Mingkang Wu, Devin White, Evelyn Rose, Vernon Lawhern, Nicholas R Waytowich, Yongcan Cao

PDF

Open Access

TL;DR

This paper introduces a multi-task reinforcement learning method that models human decision-making by integrating classification and regression tasks, improving reward inference from human ratings in RLHF.

Contribution

It proposes a novel RL approach that jointly considers multiple human decision strategies with learnable weights, capturing decision uncertainty and enhancing reward learning.

Findings

01

Outperforms existing rating-based RL methods

02

Surpasses some traditional RL approaches

03

Effectively models human decision-making strategies

Abstract

Reinforcement learning from human feedback (RLHF) has become a key factor in aligning model behavior with users' goals. However, while humans integrate multiple strategies when making decisions, current RLHF approaches often simplify this process by modeling human reasoning through isolated tasks such as classification or regression. In this paper, we propose a novel reinforcement learning (RL) method that mimics human decision-making by jointly considering multiple tasks. Specifically, we leverage human ratings in reward-free environments to infer a reward function, introducing learnable weights that balance the contributions of both classification and regression models. This design captures the inherent uncertainty in human decision-making and allows the model to adaptively emphasize different strategies. We conduct several experiments using synthetic human ratings to validate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Emotion and Mood Recognition · Reinforcement Learning in Robotics