Reinforcement Learning from User Feedback

Eric Han; Jun Chen; Karthik Abinav Sankararaman; Xiaoliang Peng; Tengyu Xu; Eryk Helenowski; Kaiyan Peng; Mrinal Kumar; Sinong Wang; Han Fang; Arya Talebzadeh

arXiv:2505.14946·cs.AI·May 22, 2025

Reinforcement Learning from User Feedback

Eric Han, Jun Chen, Karthik Abinav Sankararaman, Xiaoliang Peng, Tengyu Xu, Eryk Helenowski, Kaiyan Peng, Mrinal Kumar, Sinong Wang, Han Fang, Arya Talebzadeh

PDF

Open Access

TL;DR

This paper introduces RLUF, a framework that aligns large language models with real user preferences by using implicit feedback signals like emoji reactions, improving positive feedback rates in deployment.

Contribution

RLUF is a novel framework that directly leverages implicit user signals for LLM alignment, addressing limitations of expert-based feedback methods.

Findings

01

P[Love] predicts positive user reactions effectively.

02

RLUF increases positive feedback rates by 28% in live tests.

03

Reward hacking challenges require careful balancing of objectives.

Abstract

As large language models (LLMs) are increasingly deployed in diverse user facing applications, aligning them with real user preferences becomes essential. Existing methods like Reinforcement Learning from Human Feedback (RLHF) rely on expert annotators trained on manually defined guidelines, whose judgments may not reflect the priorities of everyday users. We introduce Reinforcement Learning from User Feedback (RLUF), a framework for aligning LLMs directly to implicit signals from users in production. RLUF addresses key challenges of user feedback: user feedback is often binary (e.g., emoji reactions), sparse, and occasionally adversarial. We train a reward model, P[Love], to predict the likelihood that an LLM response will receive a Love Reaction, a lightweight form of positive user feedback, and integrate P[Love] into a multi-objective policy optimization framework alongside…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Reinforcement Learning in Robotics · Innovation Diffusion and Forecasting