Logic and the $2$-Simplicial Transformer
James Clift, Dmitry Doryn, Daniel Murfet, James Wallbridge

TL;DR
The paper presents the 2-simplicial Transformer, an advanced neural architecture that extends attention mechanisms to higher dimensions, enhancing logical reasoning capabilities in deep reinforcement learning tasks.
Contribution
It introduces the 2-simplicial Transformer with higher-dimensional attention and demonstrates its effectiveness for logical reasoning in reinforcement learning.
Findings
Improved logical reasoning performance in RL tasks.
Higher-dimensional attention enhances entity representation updates.
The architecture generalizes traditional dot-product attention.
Abstract
We introduce the -simplicial Transformer, an extension of the Transformer which includes a form of higher-dimensional attention generalising the dot-product attention, and uses this attention to update entity representations with tensor products of value vectors. We show that this architecture is a useful inductive bias for logical reasoning in the context of deep reinforcement learning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Advanced Graph Neural Networks
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
