Medical Transformer: Universal Brain Encoder for 3D MRI Analysis
Eunji Jun, Seungwoo Jeong, Da-Woon Heo, Heung-Il Suk

TL;DR
This paper introduces Medical Transformer, a parameter-efficient transfer learning framework that models 3D MRI data as sequences of 2D slices, leveraging multi-view information and self-supervised pre-training to improve performance on various brain MRI tasks.
Contribution
The paper presents a novel multi-view Transformer architecture for 3D MRI analysis, pre-trained with self-supervised learning, achieving superior results with fewer parameters.
Findings
Outperforms state-of-the-art transfer learning methods.
Reduces model parameters by up to 92%.
Effective across multiple brain MRI tasks.
Abstract
Transfer learning has gained attention in medical image analysis due to limited annotated 3D medical datasets for training data-driven deep learning models in the real world. Existing 3D-based methods have transferred the pre-trained models to downstream tasks, which achieved promising results with only a small number of training samples. However, they demand a massive amount of parameters to train the model for 3D medical imaging. In this work, we propose a novel transfer learning framework, called Medical Transformer, that effectively models 3D volumetric images in the form of a sequence of 2D image slices. To make a high-level representation in 3D-form empowering spatial relations better, we take a multi-view approach that leverages plenty of information from the three planes of 3D volume, while providing parameter-efficient training. For building a source model generally applicable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Radiomics and Machine Learning in Medical Imaging · Medical Image Segmentation Techniques
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Softmax · Dropout · Layer Normalization · Multi-Head Attention · Byte Pair Encoding
