$\mathcal{P}^3$: Toward Versatile Embodied Agents
Shengli Zhou, Xiangchen Wang, Jinrui Zhang, Ruozai Tian, Rongtao Xu, Feng Zheng

TL;DR
The paper introduces $ ext{P}^3$, a versatile embodied agent framework that actively perceives environments, flexibly uses tools without feedback, and dynamically plans multi-task execution, enhancing real-world adaptability.
Contribution
The paper presents $ ext{P}^3$, a novel unified framework integrating real-time perception and dynamic scheduling for embodied agents, addressing key challenges in versatility and multi-task management.
Findings
Achieves high transferability in real-world environments.
Enables tool use without feedback, increasing flexibility.
Improves multi-task planning with dynamic prioritization.
Abstract
Embodied agents have shown promising generalization capabilities across diverse physical environments, making them essential for a wide range of real-world applications. However, building versatile embodied agents poses critical challenges due to three key issues: dynamic environment perception, open-ended tool usage, and complex multi-task planning. Most previous works rely solely on feedback from tool agents to perceive environmental changes and task status, which limits adaptability to real-time dynamics, causes error accumulation, and restricts tool flexibility. Furthermore, multi-task scheduling has received limited attention, primarily due to the inherent complexity of managing task dependencies and balancing competing priorities in dynamic and complex environments. To overcome these challenges, we introduce , a unified framework that integrates real-time perception…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
