Integrating LLMs and Decision Transformers for Language Grounded Generative Quality-Diversity
Achkan Salehi, Stephane Doncieux

TL;DR
This paper introduces a method combining Large Language Models and Decision Transformers to generate diverse, customizable trajectories in reinforcement learning environments, enabling natural language control and evaluation.
Contribution
It proposes a novel approach that integrates LLMs with decision transformers for language-grounded policy generation and introduces a new benchmark for evaluation.
Findings
Enables natural language specification of behaviors.
Allows customizable trajectory generation beyond discrete descriptors.
Provides an LLM-based evaluation method for generative agents.
Abstract
Quality-Diversity is a branch of stochastic optimization that is often applied to problems from the Reinforcement Learning and control domains in order to construct repertoires of well-performing policies/skills that exhibit diversity with respect to a behavior space. Such archives are usually composed of a finite number of reactive agents which are each associated to a unique behavior descriptor, and instantiating behavior descriptors outside of that coarsely discretized space is not straight-forward. While a few recent works suggest solutions to that issue, the trajectory that is generated is not easily customizable beyond the specification of a target behavior descriptor. We propose to jointly solve those problems in environments where semantic information about static scene elements is available by leveraging a Large Language Model to augment the repertoire with natural language…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Topic Modeling · Machine Learning and Data Classification
