Surface Vision Transformers: Attention-Based Modelling applied to   Cortical Analysis

Simon Dahan; Abdulah Fawaz; Logan Z. J. Williams; Chunhui Yang,; Timothy S. Coalson; Matthew F. Glasser; A. David Edwards; Daniel Rueckert,; Emma C. Robinson

arXiv:2203.16414·cs.CV·June 6, 2024·6 cites

Surface Vision Transformers: Attention-Based Modelling applied to Cortical Analysis

Simon Dahan, Abdulah Fawaz, Logan Z. J. Williams, Chunhui Yang,, Timothy S. Coalson, Matthew F. Glasser, A. David Edwards, Daniel Rueckert,, Emma C. Robinson

PDF

Open Access 1 Repo

TL;DR

This paper introduces Surface Vision Transformers (SiT), a novel attention-based model for analyzing cortical surface data, outperforming traditional surface CNNs and enabling detailed study of brain development patterns.

Contribution

The paper presents a domain-agnostic transformer architecture for surface data, using spherical patches and self-attention, advancing surface analysis beyond convolutional methods.

Findings

01

SiT outperforms surface CNNs in phenotype regression tasks.

02

SiT performs comparably on registered and unregistered data.

03

Attention maps reveal subtle cognitive developmental patterns.

Abstract

The extension of convolutional neural networks (CNNs) to non-Euclidean geometries has led to multiple frameworks for studying manifolds. Many of those methods have shown design limitations resulting in poor modelling of long-range associations, as the generalisation of convolutions to irregular surfaces is non-trivial. Motivated by the success of attention-modelling in computer vision, we translate convolution-free vision transformer approaches to surface data, to introduce a domain-agnostic architecture to study any surface data projected onto a spherical manifold. Here, surface patching is achieved by representing spherical data as a sequence of triangular patches, extracted from a subdivided icosphere. A transformer model encodes the sequence of patches via successive multi-head self-attention layers while preserving the sequence resolution. We validate the performance of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

metrics-lab/surface-vision-transformers
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Visual Attention and Saliency Detection · Face Recognition and Perception

MethodsAttention Is All You Need · Linear Layer · Softmax · Absolute Position Encodings · Layer Normalization · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Label Smoothing