Demonstration-Free Robotic Control via LLM Agents
Brian Y. Tsui, Alan Y. Fang, Tiffany J. Hwu

TL;DR
This paper demonstrates that large language model (LLM) agents can control robotic manipulation tasks without demonstrations or fine-tuning, achieving high success rates across multiple benchmarks by reasoning through strategies.
Contribution
It introduces FAEA, a novel approach applying unmodified LLM agent frameworks to embodied manipulation, enabling demonstration-free control and planning in robotics.
Findings
FAEA achieves success rates of 84.9%, 85.7%, and 96% on LIBERO, ManiSkill3, and MetaWorld.
Performance approaches VLA models trained with fewer than 100 demonstrations.
Optional human feedback further improves success rate to 88.2% on LIBERO.
Abstract
Robotic manipulation has increasingly adopted vision-language-action (VLA) models, which achieve strong performance but typically require task-specific demonstrations and fine-tuning, and often generalize poorly under domain shift. We investigate whether general-purpose large language model (LLM) agent frameworks, originally developed for software engineering, can serve as an alternative control paradigm for embodied manipulation. We introduce FAEA (Frontier Agent as Embodied Agent), which applies an LLM agent framework directly to embodied manipulation without modification. Using the same iterative reasoning that enables software agents to debug code, FAEA enables embodied agents to reason through manipulation strategies. We evaluate an unmodified frontier agent, Claude Agent SDK, across the LIBERO, ManiSkill3, and MetaWorld benchmarks. With privileged environment state access, FAEA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Reinforcement Learning in Robotics · Robot Manipulation and Learning
