TL;DR
LEO-RobotAgent is a versatile, language-driven robotic framework that enables robots to perform complex tasks across various scenarios with improved human-robot interaction and adaptability.
Contribution
It introduces a streamlined, general-purpose framework allowing large models to independently think, plan, and act across multiple robot types and tasks.
Findings
Framework adapts to UAVs, robotic arms, and wheeled robots.
Efficiently executes tasks of varying complexity.
Enhances bidirectional human-robot intent understanding.
Abstract
We propose LEO-RobotAgent, a general-purpose language-driven intelligent agent framework for robots. Under this framework, LLMs can operate different types of robots to complete unpredictable complex tasks across various scenarios. This framework features strong generalization, robustness, and efficiency. The application-level system built around it can fully enhance bidirectional human-robot intent understanding and lower the threshold for human-robot interaction. Regarding robot task planning, the vast majority of existing studies focus on the application of large models in single-task scenarios and for single robot types. These algorithms often have complex structures and lack generalizability. Thus, the proposed LEO-RobotAgent framework is designed with a streamlined structure as much as possible, enabling large models to independently think, plan, and act within this clear…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
