SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement   Learning

Matthias Weissenbacher; Rishabh Agarwal; Yoshinobu Kawahara

arXiv:2406.15025·cs.LG·June 24, 2024

SiT: Symmetry-Invariant Transformers for Generalisation in Reinforcement Learning

Matthias Weissenbacher, Rishabh Agarwal, Yoshinobu Kawahara

PDF

Open Access 1 Repo

TL;DR

SiT introduces a symmetry-aware vision transformer that enhances generalization and sample efficiency in reinforcement learning by preserving graph symmetries through a novel attention mechanism.

Contribution

The paper proposes Graph Symmetric Attention within a scalable ViT, enabling invariance to symmetries and improving RL generalization and sample efficiency.

Findings

01

Outperforms ViTs on MiniGrid and Procgen benchmarks

02

Achieves better sample efficiency on Atari 100k and CIFAR10

03

Demonstrates improved generalization in RL environments

Abstract

An open challenge in reinforcement learning (RL) is the effective deployment of a trained policy to new or slightly different situations as well as semantically-similar environments. We introduce Symmetry-Invariant Transformer (SiT), a scalable vision transformer (ViT) that leverages both local and global data patterns in a self-supervised manner to improve generalisation. Central to our approach is Graph Symmetric Attention, which refines the traditional self-attention mechanism to preserve graph symmetries, resulting in invariant and equivariant latent representations. We showcase SiT's superior generalization over ViTs on MiniGrid and Procgen RL benchmarks, and its sample efficiency on Atari 100k and CIFAR10.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matthias-weissenbacher/SiT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Neural Networks and Applications

MethodsAttention Is All You Need · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer · Absolute Position Encodings