Inductive Biases for Zero-shot Systematic Generalization in   Language-informed Reinforcement Learning

Negin Hashemi Dijujin; Seyed Roozbeh Razavi Rohani; Mohammad Mahdi; Samiei; Mahdieh Soleymani Baghshah

arXiv:2501.15270·cs.LG·January 28, 2025

Inductive Biases for Zero-shot Systematic Generalization in Language-informed Reinforcement Learning

Negin Hashemi Dijujin, Seyed Roozbeh Razavi Rohani, Mohammad Mahdi, Samiei, Mahdieh Soleymani Baghshah

PDF

Open Access 1 Repo

TL;DR

This paper introduces architecture-level inductive biases in language-informed reinforcement learning, leveraging neural production systems and memory to enhance systematic generalization and sample efficiency in the BabyAI environment.

Contribution

It proposes a novel architecture with inductive biases, neural production systems, and memory mechanisms to improve generalization and efficiency in language-informed RL.

Findings

01

Significant improvement in systematic generalization in BabyAI

02

Enhanced sample efficiency over previous models

03

Ablation study clarifies the impact of each technique

Abstract

Sample efficiency and systematic generalization are two long-standing challenges in reinforcement learning. Previous studies have shown that involving natural language along with other observation modalities can improve generalization and sample efficiency due to its compositional and open-ended nature. However, to transfer these properties of language to the decision-making process, it is necessary to establish a proper language grounding mechanism. One approach to this problem is applying inductive biases to extract fine-grained and informative representations from the observations, which makes them more connectable to the language units. We provide architecture-level inductive biases for modularity and sparsity mainly based on Neural Production Systems (NPS). Alongside NPS, we assign a central role to memory in our architecture. It can be seen as a high-level information aggregator…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nhashemi202/ICMO
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Reinforcement Learning in Robotics · Evolutionary Algorithms and Applications

MethodsSoftmax · Attention Is All You Need