Surface Vision Transformers: Flexible Attention-Based Modelling of   Biomedical Surfaces

Simon Dahan; Hao Xu; Logan Z. J. Williams; Abdulah Fawaz; Chunhui; Yang; Timothy S. Coalson; Michelle C. Williams; David E. Newby; A. David; Edwards; Matthew F. Glasser; Alistair A. Young; Daniel Rueckert; Emma C.; Robinson

arXiv:2204.03408·eess.IV·April 8, 2022·1 cites

Surface Vision Transformers: Flexible Attention-Based Modelling of Biomedical Surfaces

Simon Dahan, Hao Xu, Logan Z. J. Williams, Abdulah Fawaz, Chunhui, Yang, Timothy S. Coalson, Michelle C. Williams, David E. Newby, A. David, Edwards, Matthew F. Glasser, Alistair A. Young, Daniel Rueckert, Emma C., Robinson

PDF

Open Access 1 Repo

TL;DR

This paper introduces Surface Vision Transformers (SiT), a novel approach extending Vision Transformers to biomedical surfaces, demonstrating improved predictive performance and interpretability across various clinical tasks.

Contribution

It reformulates surface learning as sequence-to-sequence modeling with patching mechanisms, enabling ViTs to process complex biomedical surface data effectively.

Findings

01

SiT outperforms geometric deep learning in brain age and fluid intelligence prediction.

02

Achieves comparable performance to clinical metrics in calcium score classification.

03

Attention maps provide interpretable insights into task-specific features.

Abstract

Recent state-of-the-art performances of Vision Transformers (ViT) in computer vision tasks demonstrate that a general-purpose architecture, which implements long-range self-attention, could replace the local feature learning operations of convolutional neural networks. In this paper, we extend ViTs to surfaces by reformulating the task of surface learning as a sequence-to-sequence learning problem, by proposing patching mechanisms for general surface meshes. Sequences of patches are then processed by a transformer encoder and used for classification or regression. We validate our method on a range of different biomedical surface domains and tasks: brain age prediction in the developing Human Connectome Project (dHCP), fluid intelligence prediction in the Human Connectome Project (HCP), and coronary artery calcium score classification using surfaces from the Scottish Computed Tomography…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

metrics-lab/surface-vision-transformers
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetinal Imaging and Analysis · Artificial Intelligence in Healthcare and Education · Age of Information Optimization