Conceptual Reinforcement Learning for Language-Conditioned Tasks

Shaohui Peng; Xing Hu; Rui Zhang; Jiaming Guo; Qi Yi; Ruizhi Chen,; Zidong Du; Ling Li; Qi Guo; Yunji Chen

arXiv:2303.05069·cs.LG·March 10, 2023·1 cites

Conceptual Reinforcement Learning for Language-Conditioned Tasks

Shaohui Peng, Xing Hu, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen,, Zidong Du, Ling Li, Qi Guo, Yunji Chen

PDF

Open Access 1 Video

TL;DR

This paper introduces a conceptual reinforcement learning framework that learns invariant, concept-like representations for language-conditioned policies, significantly enhancing transferability and efficiency in unseen environments.

Contribution

It proposes a novel CRL framework with multi-level attention and mutual information constraints to improve generalization in language-conditioned RL tasks.

Findings

01

Improves training efficiency by up to 70%.

02

Enhances generalization to new environments by up to 30%.

03

Effective in challenging environments RTFM and Messenger.

Abstract

Despite the broad application of deep reinforcement learning (RL), transferring and adapting the policy to unseen but similar environments is still a significant challenge. Recently, the language-conditioned policy is proposed to facilitate policy transfer through learning the joint representation of observation and text that catches the compact and invariant information across environments. Existing studies of language-conditioned RL methods often learn the joint representation as a simple latent layer for the given instances (episode-specific observation and text), which inevitably includes noisy or irrelevant information and cause spurious correlations that are dependent on instances, thus hurting generalization performance and training efficiency. To address this issue, we propose a conceptual reinforcement learning (CRL) framework to learn the concept-like joint representation for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Conceptual Reinforcement Learning for Language-Conditioned Tasks· underline

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications