Infusing Lattice Symmetry Priors in Attention Mechanisms for   Sample-Efficient Abstract Geometric Reasoning

Mattia Atzeni; Mrinmaya Sachan; Andreas Loukas

arXiv:2306.03175·cs.AI·June 7, 2023·1 cites

Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning

Mattia Atzeni, Mrinmaya Sachan, Andreas Loukas

PDF

Open Access 1 Video

TL;DR

This paper introduces LatFormer, a model that embeds lattice symmetry priors into attention mechanisms, significantly improving sample efficiency and geometric reasoning capabilities in AI tasks like ARC and LARC.

Contribution

The paper proposes a novel attention modification that incorporates lattice symmetry priors, enabling models to better generalize in geometric reasoning tasks.

Findings

01

LatFormer requires 100 times less data than standard transformers.

02

Incorporating geometric priors improves performance on synthetic geometric reasoning.

03

Preliminary results suggest deep learning models can handle complex geometric datasets.

Abstract

The Abstraction and Reasoning Corpus (ARC) (Chollet, 2019) and its most recent language-complete instantiation (LARC) has been postulated as an important step towards general AI. Yet, even state-of-the-art machine learning models struggle to achieve meaningful performance on these problems, falling behind non-learning based approaches. We argue that solving these tasks requires extreme generalization that can only be achieved by proper accounting for core knowledge priors. As a step towards this goal, we focus on geometry priors and introduce LatFormer, a model that incorporates lattice symmetry priors in attention masks. We show that, for any transformation of the hypercubic lattice, there exists a binary attention mask that implements that group action. Hence, our study motivates a modification to the standard attention mechanism, where attention weights are scaled using soft masks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning· slideslive

Taxonomy

TopicsMachine Learning in Materials Science · Topic Modeling

MethodsFocus