AURA: Autonomous Upskilling with Retrieval-Augmented Agents
Alvin Zhu, Yusuke Tanaka, Andrew Goldberg, Dennis Hong

TL;DR
AURA is a novel framework that uses retrieval-augmented large language models to autonomously design, validate, and refine reinforcement learning curricula for agile robots, enabling scalable and adaptive policy training from user prompts.
Contribution
AURA introduces an automated, schema-validated curriculum RL framework leveraging LLMs and retrieval feedback, reducing manual tuning and enabling zero-shot deployment on robots.
Findings
Outperforms LLM-guided baselines in success rate, locomotion, and manipulation.
Schema validation and retrieval are crucial for curriculum quality.
Successfully trains and deploys policies directly from user prompts.
Abstract
Designing reinforcement learning curricula for agile robots traditionally requires extensive manual tuning of reward functions, environment randomizations, and training configurations. We introduce AURA (Autonomous Upskilling with Retrieval-Augmented Agents), a schema-validated curriculum reinforcement learning (RL) framework that leverages Large Language Models (LLMs) as autonomous designers of multi-stage curricula. AURA transforms user prompts into YAML workflows that encode full reward functions, domain randomization strategies, and training configurations. All files are statically validated before any GPU time is used, ensuring efficient and reliable execution. A retrieval-augmented feedback loop allows specialized LLM agents to design, execute, and refine curriculum stages based on prior training results stored in a vector database, enabling continual improvement over time.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
