Compositional Instruction Following with Language Models and   Reinforcement Learning

Vanya Cohen; Geraud Nangue Tasse; Nakul Gopalan; Steven James; Matthew; Gombolay; Ray Mooney; Benjamin Rosman

arXiv:2501.12539·cs.LG·January 23, 2025

Compositional Instruction Following with Language Models and Reinforcement Learning

Vanya Cohen, Geraud Nangue Tasse, Nakul Gopalan, Steven James, Matthew, Gombolay, Ray Mooney, Benjamin Rosman

PDF

Open Access

TL;DR

This paper introduces CERLLA, a reinforcement learning approach that leverages compositional policy representations and semantic parsing to improve language-conditioned task learning efficiency and generalization in complex environments.

Contribution

The paper presents CERLLA, a novel method combining compositional policies and reinforcement learning to enhance sample efficiency and generalization in language-conditioned tasks.

Findings

01

CERLLA significantly outperforms non-compositional baselines in sample complexity.

02

CERLLA achieves a success rate of 92%, matching the oracle policy.

03

The method demonstrates strong compositional generalization to new tasks.

Abstract

Combining reinforcement learning with language grounding is challenging as the agent needs to explore the environment while simultaneously learning multiple language-conditioned tasks. To address this, we introduce a novel method: the compositionally-enabled reinforcement learning language agent (CERLLA). Our method reduces the sample complexity of tasks specified with language by leveraging compositional policy representations and a semantic parser trained using reinforcement learning and in-context learning. We evaluate our approach in an environment requiring function approximation and demonstrate compositional generalization to novel tasks. Our method significantly outperforms the previous best non-compositional baseline in terms of sample complexity on 162 tasks designed to test compositional generalization. Our model attains a higher success rate and learns in fewer steps than the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovative Teaching and Learning Methods