Translation Equivariant Transformer Neural Processes

Matthew Ashman; Cristiana Diaconu; Junhyuck Kim; Lakee Sivaraya,; Stratis Markou; James Requeima; Wessel P. Bruinsma; Richard E. Turner

arXiv:2406.12409·stat.ML·June 19, 2024

Translation Equivariant Transformer Neural Processes

Matthew Ashman, Cristiana Diaconu, Junhyuck Kim, Lakee Sivaraya,, Stratis Markou, James Requeima, Wessel P. Bruinsma, Richard E. Turner

PDF

Open Access 1 Repo

TL;DR

This paper introduces translation equivariant Transformer Neural Processes (TE-TNPs), enhancing neural process models by incorporating translation symmetry, leading to improved performance on spatio-temporal data.

Contribution

The paper proposes a novel family of translation equivariant TNPs that explicitly incorporate translation symmetry into the model architecture.

Findings

01

TE-TNPs outperform non-equivariant models on synthetic data

02

TE-TNPs show improved accuracy on real-world spatio-temporal datasets

03

Translation equivariance enhances model robustness and generalization

Abstract

The effectiveness of neural processes (NPs) in modelling posterior prediction maps -- the mapping from data to posterior predictive distributions -- has significantly improved since their inception. This improvement can be attributed to two principal factors: (1) advancements in the architecture of permutation invariant set functions, which are intrinsic to all NPs; and (2) leveraging symmetries present in the true posterior predictive map, which are problem dependent. Transformers are a notable development in permutation invariant set functions, and their utility within NPs has been demonstrated through the family of models we refer to as TNPs. Despite significant interest in TNPs, little attention has been given to incorporating symmetries. Notably, the posterior prediction maps for data that are stationary -- a common assumption in spatio-temporal modelling -- exhibit translation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cambridge-mlg/tetnp
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsSoftmax · Attention Is All You Need · Sparse Evolutionary Training