Self-Attention in Colors: Another Take on Encoding Graph Structure in   Transformers

Romain Menegaux; Emmanuel Jehanno; Margot Selosse; Julien; Mairal

arXiv:2304.10933·cs.LG·April 24, 2023·1 cites

Self-Attention in Colors: Another Take on Encoding Graph Structure in Transformers

Romain Menegaux, Emmanuel Jehanno, Margot Selosse, Julien, Mairal

PDF

Open Access 1 Repo

TL;DR

This paper introduces CSA, a novel self-attention mechanism for graph transformers that encodes structural and topological information without message passing, achieving state-of-the-art results on molecular datasets.

Contribution

The paper presents CSA, a new self-attention method that modulates feature channels and encodes graph structure using random walk-based positional encoding, bypassing message passing.

Findings

01

Achieves state-of-the-art results on ZINC dataset.

02

Effectively encodes higher-order topological features.

03

Provides a flexible framework for graph structure encoding.

Abstract

We introduce a novel self-attention mechanism, which we call CSA (Chromatic Self-Attention), which extends the notion of attention scores to attention _filters_, independently modulating the feature channels. We showcase CSA in a fully-attentional graph Transformer CGT (Chromatic Graph Transformer) which integrates both graph structural information and edge features, completely bypassing the need for local message-passing components. Our method flexibly encodes graph structure through node-node interactions, by enriching the original edge features with a relative positional encoding scheme. We propose a new scheme based on random walks that encodes both structural and positional information, and show how to incorporate higher-order topological information, such as rings in molecular graphs. Our approach achieves state-of-the-art results on the ZINC benchmark dataset, while providing a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

inria-thoth/csa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Graph Neural Networks

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Dense Connections · Label Smoothing · Dropout · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection