Conceptual Reinforcement Learning for Language-Conditioned Tasks
Shaohui Peng, Xing Hu, Rui Zhang, Jiaming Guo, Qi Yi, Ruizhi Chen,, Zidong Du, Ling Li, Qi Guo, Yunji Chen

TL;DR
This paper introduces a conceptual reinforcement learning framework that learns invariant, concept-like representations for language-conditioned policies, significantly enhancing transferability and efficiency in unseen environments.
Contribution
It proposes a novel CRL framework with multi-level attention and mutual information constraints to improve generalization in language-conditioned RL tasks.
Findings
Improves training efficiency by up to 70%.
Enhances generalization to new environments by up to 30%.
Effective in challenging environments RTFM and Messenger.
Abstract
Despite the broad application of deep reinforcement learning (RL), transferring and adapting the policy to unseen but similar environments is still a significant challenge. Recently, the language-conditioned policy is proposed to facilitate policy transfer through learning the joint representation of observation and text that catches the compact and invariant information across environments. Existing studies of language-conditioned RL methods often learn the joint representation as a simple latent layer for the given instances (episode-specific observation and text), which inevitably includes noisy or irrelevant information and cause spurious correlations that are dependent on instances, thus hurting generalization performance and training efficiency. To address this issue, we propose a conceptual reinforcement learning (CRL) framework to learn the concept-like joint representation for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
