Game On: Towards Language Models as RL Experimenters

Jingwei Zhang; Thomas Lampe; Abbas Abdolmaleki; Jost Tobias; Springenberg; Martin Riedmiller

arXiv:2409.03402·cs.AI·September 6, 2024

Game On: Towards Language Models as RL Experimenters

Jingwei Zhang, Thomas Lampe, Abbas Abdolmaleki, Jost Tobias, Springenberg, Martin Riedmiller

PDF

Open Access

TL;DR

This paper introduces an innovative agent architecture that automates reinforcement learning experiments using a vision-language model, enabling autonomous curriculum creation and skill acquisition for embodied agents, demonstrated through a robotics domain.

Contribution

The work presents one of the first systems leveraging a vision-language model throughout the entire RL experiment cycle for autonomous mastery of control tasks.

Findings

01

System can generate curricula for skill learning

02

Data collected improves control policy training

03

Promising results in building skill libraries and progress assessment

Abstract

We propose an agent architecture that automates parts of the common reinforcement learning experiment workflow, to enable automated mastery of control domains for embodied agents. To do so, it leverages a VLM to perform some of the capabilities normally required of a human experimenter, including the monitoring and analysis of experiment progress, the proposition of new tasks based on past successes and failures of the agent, decomposing tasks into a sequence of subtasks (skills), and retrieval of the skill to execute - enabling our system to build automated curricula for learning. We believe this is one of the first proposals for a system that leverages a VLM throughout the full experiment cycle of reinforcement learning. We provide a first prototype of this system, and examine the feasibility of current models and techniques for the desired level of automation. For this, we use a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Natural Language Processing Techniques

MethodsLib