Surface Vision Transformers: Flexible Attention-Based Modelling of Biomedical Surfaces
Simon Dahan, Hao Xu, Logan Z. J. Williams, Abdulah Fawaz, Chunhui, Yang, Timothy S. Coalson, Michelle C. Williams, David E. Newby, A. David, Edwards, Matthew F. Glasser, Alistair A. Young, Daniel Rueckert, Emma C., Robinson

TL;DR
This paper introduces Surface Vision Transformers (SiT), a novel approach extending Vision Transformers to biomedical surfaces, demonstrating improved predictive performance and interpretability across various clinical tasks.
Contribution
It reformulates surface learning as sequence-to-sequence modeling with patching mechanisms, enabling ViTs to process complex biomedical surface data effectively.
Findings
SiT outperforms geometric deep learning in brain age and fluid intelligence prediction.
Achieves comparable performance to clinical metrics in calcium score classification.
Attention maps provide interpretable insights into task-specific features.
Abstract
Recent state-of-the-art performances of Vision Transformers (ViT) in computer vision tasks demonstrate that a general-purpose architecture, which implements long-range self-attention, could replace the local feature learning operations of convolutional neural networks. In this paper, we extend ViTs to surfaces by reformulating the task of surface learning as a sequence-to-sequence learning problem, by proposing patching mechanisms for general surface meshes. Sequences of patches are then processed by a transformer encoder and used for classification or regression. We validate our method on a range of different biomedical surface domains and tasks: brain age prediction in the developing Human Connectome Project (dHCP), fluid intelligence prediction in the Human Connectome Project (HCP), and coronary artery calcium score classification using surfaces from the Scottish Computed Tomography…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRetinal Imaging and Analysis · Artificial Intelligence in Healthcare and Education · Age of Information Optimization
