Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning
Negin Hashemi Dijujin, Seyed Roozbeh Razavi Rohani, Mohammad Mahdi, Samiei, Mahdieh Soleymani Baghshah

TL;DR
This paper introduces architecture-level inductive biases in language-informed reinforcement learning, leveraging neural production systems and memory to enhance systematic generalization and sample efficiency in the BabyAI environment.
Contribution
It proposes a novel architecture with inductive biases, neural production systems, and memory mechanisms to improve generalization and efficiency in language-informed RL.
Findings
Significant improvement in systematic generalization in BabyAI
Enhanced sample efficiency over previous models
Ablation study clarifies the impact of each technique
Abstract
Sample efficiency and systematic generalization are two long-standing challenges in reinforcement learning. Previous studies have shown that involving natural language along with other observation modalities can improve generalization and sample efficiency due to its compositional and open-ended nature. However, to transfer these properties of language to the decision-making process, it is necessary to establish a proper language grounding mechanism. One approach to this problem is applying inductive biases to extract fine-grained and informative representations from the observations, which makes them more connectable to the language units. We provide architecture-level inductive biases for modularity and sparsity mainly based on Neural Production Systems (NPS). Alongside NPS, we assign a central role to memory in our architecture. It can be seen as a high-level information aggregator…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Reinforcement Learning in Robotics · Evolutionary Algorithms and Applications
MethodsSoftmax · Attention Is All You Need
