Learning to Plan with Personalized Preferences

Manjie Xu; Xinyi Yang; Wei Liang; Chi Zhang; Yixin Zhu

arXiv:2502.00858·cs.AI·September 15, 2025

Learning to Plan with Personalized Preferences

Manjie Xu, Xinyi Yang, Wei Liang, Chi Zhang, Yixin Zhu

PDF

Open Access

TL;DR

This paper introduces a new benchmark and methods for AI agents to learn and adapt to individual human preferences in planning tasks, improving personalized plan generation.

Contribution

It develops a framework for agents to learn preferences from few demonstrations and adapt their planning strategies accordingly, introducing the Preference-based Planning benchmark.

Findings

01

Symbol-based approaches show promise in scalability.

02

Significant challenges remain in learning personalized plans.

03

Incorporating preferences improves plan personalization.

Abstract

Effective integration of AI agents into daily life requires them to understand and adapt to individual human preferences, particularly in collaborative roles. Although recent studies on embodied intelligence have advanced significantly, they typically adopt generalized approaches that overlook personal preferences in planning. We address this limitation by developing agents that not only learn preferences from few demonstrations but also learn to adapt their planning strategies based on these preferences. Our research leverages the observation that preferences, though implicitly expressed through minimal demonstrations, can generalize across diverse planning scenarios. To systematically evaluate this hypothesis, we introduce Preference-based Planning (PbP) benchmark, an embodied benchmark featuring hundreds of diverse preferences spanning from atomic actions to complex sequences. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning

MethodsADaptive gradient method with the OPTimal convergence rate