Logic and the $2$-Simplicial Transformer

James Clift; Dmitry Doryn; Daniel Murfet; James Wallbridge

arXiv:1909.00668·cs.LG·September 15, 2019

Logic and the $2$-Simplicial Transformer

James Clift, Dmitry Doryn, Daniel Murfet, James Wallbridge

PDF

Open Access

TL;DR

The paper presents the 2-simplicial Transformer, an advanced neural architecture that extends attention mechanisms to higher dimensions, enhancing logical reasoning capabilities in deep reinforcement learning tasks.

Contribution

It introduces the 2-simplicial Transformer with higher-dimensional attention and demonstrates its effectiveness for logical reasoning in reinforcement learning.

Findings

01

Improved logical reasoning performance in RL tasks.

02

Higher-dimensional attention enhances entity representation updates.

03

The architecture generalizes traditional dot-product attention.

Abstract

We introduce the $2$ -simplicial Transformer, an extension of the Transformer which includes a form of higher-dimensional attention generalising the dot-product attention, and uses this attention to update entity representations with tensor products of value vectors. We show that this architecture is a useful inductive bias for logical reasoning in the context of deep reinforcement learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Advanced Graph Neural Networks

MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax