Learning Personalized Agents from Human Feedback

Kaiqu Liang; Julia Kruk; Shengyi Qian; Xianjun Yang; Shengjie Bi; Yuanshun Yao; Shaoliang Nie; Mingyang Zhang; Lijuan Liu; Jaime Fern\'andez Fisac; Shuyan Zhou; Saghar Hosseini

arXiv:2602.16173·cs.AI·February 19, 2026

Learning Personalized Agents from Human Feedback

Kaiqu Liang, Julia Kruk, Shengyi Qian, Xianjun Yang, Shengjie Bi, Yuanshun Yao, Shaoliang Nie, Mingyang Zhang, Lijuan Liu, Jaime Fern\'andez Fisac, Shuyan Zhou, Saghar Hosseini

PDF

Open Access

TL;DR

This paper introduces PAHF, a framework enabling AI agents to learn and adapt to individual user preferences over time through online interaction and explicit memory, improving personalization and responsiveness.

Contribution

The paper proposes PAHF, a novel continual learning framework that uses explicit memory and dual feedback channels for personalized agents to adapt to evolving user preferences.

Findings

01

PAHF learns faster than baseline methods.

02

PAHF outperforms no-memory and single-channel baselines.

03

PAHF effectively adapts to preference shifts.

Abstract

Modern AI agents are powerful but often fail to align with the idiosyncratic, evolving preferences of individual users. Prior approaches typically rely on static datasets, either training implicit preference models on interaction history or encoding user profiles in external memory. However, these approaches struggle with new users and with preferences that change over time. We introduce Personalized Agents from Human Feedback (PAHF), a framework for continual personalization in which agents learn online from live interaction using explicit per-user memory. PAHF operationalizes a three-step loop: (1) seeking pre-action clarification to resolve ambiguity, (2) grounding actions in preferences retrieved from memory, and (3) integrating post-action feedback to update memory when preferences drift. To evaluate this capability, we develop a four-phase protocol and two benchmarks in embodied…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Recommender Systems and Techniques · Action Observation and Synchronization