Learning to Plan with Personalized Preferences
Manjie Xu, Xinyi Yang, Wei Liang, Chi Zhang, Yixin Zhu

TL;DR
This paper introduces a new benchmark and methods for AI agents to learn and adapt to individual human preferences in planning tasks, improving personalized plan generation.
Contribution
It develops a framework for agents to learn preferences from few demonstrations and adapt their planning strategies accordingly, introducing the Preference-based Planning benchmark.
Findings
Symbol-based approaches show promise in scalability.
Significant challenges remain in learning personalized plans.
Incorporating preferences improves plan personalization.
Abstract
Effective integration of AI agents into daily life requires them to understand and adapt to individual human preferences, particularly in collaborative roles. Although recent studies on embodied intelligence have advanced significantly, they typically adopt generalized approaches that overlook personal preferences in planning. We address this limitation by developing agents that not only learn preferences from few demonstrations but also learn to adapt their planning strategies based on these preferences. Our research leverages the observation that preferences, though implicitly expressed through minimal demonstrations, can generalize across diverse planning scenarios. To systematically evaluate this hypothesis, we introduce Preference-based Planning (PbP) benchmark, an embodied benchmark featuring hundreds of diverse preferences spanning from atomic actions to complex sequences. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning
MethodsADaptive gradient method with the OPTimal convergence rate
