Egocentric Planning for Scalable Embodied Task Achievement
Xiaotian Liu, Hector Palacios, Christian Muise

TL;DR
This paper introduces Egocentric Planning, a hybrid symbolic and probabilistic approach enabling embodied agents to generalize and scale in complex domestic tasks, demonstrating high success rates and robustness in simulated environments.
Contribution
The work presents a novel combination of symbolic planning and Object-oriented POMDPs for scalable, robust embodied task execution, leveraging existing perception and language models.
Findings
Achieved 36.07% unseen success rate in ALFRED benchmark.
Won the ALFRED challenge at CVPR Embodied AI workshop.
Demonstrated scalability to new tasks beyond ALFRED.
Abstract
Embodied agents face significant challenges when tasked with performing actions in diverse environments, particularly in generalizing across object types and executing suitable actions to accomplish tasks. Furthermore, agents should exhibit robustness, minimizing the execution of illegal actions. In this work, we present Egocentric Planning, an innovative approach that combines symbolic planning and Object-oriented POMDPs to solve tasks in complex environments, harnessing existing models for visual perception and natural language processing. We evaluated our approach in ALFRED, a simulated environment designed for domestic tasks, and demonstrated its high scalability, achieving an impressive 36.07% unseen success rate in the ALFRED benchmark and winning the ALFRED challenge at CVPR Embodied AI workshop. Our method requires reliable perception and the specification or learning of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Human Pose and Action Recognition · Robotic Path Planning Algorithms
