AURA: Autonomous Upskilling with Retrieval-Augmented Agents

Alvin Zhu; Yusuke Tanaka; Andrew Goldberg; Dennis Hong

arXiv:2506.02507·cs.RO·November 6, 2025

AURA: Autonomous Upskilling with Retrieval-Augmented Agents

Alvin Zhu, Yusuke Tanaka, Andrew Goldberg, Dennis Hong

PDF

TL;DR

AURA is a novel framework that uses retrieval-augmented large language models to autonomously design, validate, and refine reinforcement learning curricula for agile robots, enabling scalable and adaptive policy training from user prompts.

Contribution

AURA introduces an automated, schema-validated curriculum RL framework leveraging LLMs and retrieval feedback, reducing manual tuning and enabling zero-shot deployment on robots.

Findings

01

Outperforms LLM-guided baselines in success rate, locomotion, and manipulation.

02

Schema validation and retrieval are crucial for curriculum quality.

03

Successfully trains and deploys policies directly from user prompts.

Abstract

Designing reinforcement learning curricula for agile robots traditionally requires extensive manual tuning of reward functions, environment randomizations, and training configurations. We introduce AURA (Autonomous Upskilling with Retrieval-Augmented Agents), a schema-validated curriculum reinforcement learning (RL) framework that leverages Large Language Models (LLMs) as autonomous designers of multi-stage curricula. AURA transforms user prompts into YAML workflows that encode full reward functions, domain randomization strategies, and training configurations. All files are statically validated before any GPU time is used, ensuring efficient and reliable execution. A retrieval-augmented feedback loop allows specialized LLM agents to design, execute, and refine curriculum stages based on prior training results stored in a vector database, enabling continual improvement over time.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.