LgTS: Dynamic Task Sampling using LLM-generated sub-goals for   Reinforcement Learning Agents

Yash Shukla; Wenchang Gao; Vasanth Sarathy; Alvaro Velasquez; Robert; Wright; Jivko Sinapov

arXiv:2310.09454·cs.AI·October 17, 2023·2 cites

LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents

Yash Shukla, Wenchang Gao, Vasanth Sarathy, Alvaro Velasquez, Robert, Wright, Jivko Sinapov

PDF

Open Access

TL;DR

This paper introduces LgTS, a novel method that leverages LLM-generated sub-goals to guide reinforcement learning agents in environments with unknown dynamics, reducing interactions and improving policy learning without requiring fine-tuned models.

Contribution

The work presents a new approach that uses LLMs for sub-goal generation in RL without fine-tuning or pre-trained policies, and employs Teacher-Student learning to minimize environment interactions.

Findings

01

Graphical sub-goal structures aid policy learning.

02

LgTS reduces environment interactions in unknown dynamics.

03

Effective in gridworld and search-and-rescue domains.

Abstract

Recent advancements in reasoning abilities of Large Language Models (LLM) has promoted their usage in problems that require high-level planning for robots and artificial agents. However, current techniques that utilize LLMs for such planning tasks make certain key assumptions such as, access to datasets that permit finetuning, meticulously engineered prompts that only provide relevant and essential information to the LLM, and most importantly, a deterministic approach to allow execution of the LLM responses either in the form of existing policies or plan operators. In this work, we propose LgTS (LLM-guided Teacher-Student learning), a novel approach that explores the planning abilities of LLMs to provide a graphical representation of the sub-goals to a reinforcement learning (RL) agent that does not have access to the transition dynamics of the environment. The RL agent uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications