Behavior Cloned Transformers are Neurosymbolic Reasoners
Ruoyao Wang, Peter Jansen, Marc-Alexandre C\^ot\'e, Prithviraj, Ammanabrolu

TL;DR
This paper demonstrates that augmenting behavior cloned transformers with symbolic module actions significantly improves their multi-step reasoning abilities in text-based games, achieving state-of-the-art performance.
Contribution
It introduces a method to inject symbolic module actions into behavior cloned transformers, enhancing reasoning and generalization in grounded language environments.
Findings
Performance increased by an average of 22% across four benchmarks.
Agents reached highest performance on unseen games.
Action injection is easily extendable to new environments.
Abstract
In this work, we explore techniques for augmenting interactive agents with information from symbolic modules, much like humans use tools like calculators and GPS systems to assist with arithmetic and navigation. We test our agent's abilities in text games -- challenging benchmarks for evaluating the multi-step reasoning abilities of game agents in grounded, language-based environments. Our experimental study indicates that injecting the actions from these symbolic modules into the action space of a behavior cloned transformer agent increases performance on four text game benchmarks that test arithmetic, navigation, sorting, and common sense reasoning by an average of 22%, allowing an agent to reach the highest possible performance on unseen games. This action injection technique is easily extended to new agents, environments, and symbolic modules.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Topic Modeling
MethodsTest · Greedy Policy Search
