Enhancing Reinforcement Learning with discrete interfaces to learn the   Dyck Language

Florian Dietz; Dietrich Klakow

arXiv:2110.14350·cs.LG·October 28, 2021

Enhancing Reinforcement Learning with discrete interfaces to learn the Dyck Language

Florian Dietz, Dietrich Klakow

PDF

Open Access

TL;DR

This paper introduces a novel reinforcement learning approach that incorporates discrete interfaces to enable neural networks to understand and generate hierarchical structures like the Dyck language, achieving significant generalization and efficiency.

Contribution

It presents the first neural network solution that learns to utilize discrete data structures for hierarchical language understanding in reinforcement learning.

Findings

01

Model generalizes to sequences ten times longer than training data

02

Pre-training on execution traces improves training stability

03

Resulting model is small and fast

Abstract

Even though most interfaces in the real world are discrete, no efficient way exists to train neural networks to make use of them, yet. We enhance an Interaction Network (a Reinforcement Learning architecture) with discrete interfaces and train it on the generalized Dyck language. This task requires an understanding of hierarchical structures to solve, and has long proven difficult for neural networks. We provide the first solution based on learning to use discrete data structures. We encountered unexpected anomalous behavior during training, and utilized pre-training based on execution traces to overcome them. The resulting model is very small and fast, and generalizes to sequences that are an entire order of magnitude longer than the training data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFerroelectric and Negative Capacitance Devices · Neural Networks and Applications · Neural Networks and Reservoir Computing