Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging

Badhan Kumar Das; Gengyan Zhao; Han Liu; Thomas J. Re; Dorin Comaniciu; Eli Gibson; and Andreas Maier

arXiv:2501.09096·eess.IV·June 3, 2025

Self Pre-training with Adaptive Mask Autoencoders for Variable-Contrast 3D Medical Imaging

Badhan Kumar Das, Gengyan Zhao, Han Liu, Thomas J. Re, Dorin Comaniciu, Eli Gibson, and Andreas Maier

PDF

Open Access

TL;DR

This paper introduces a 3D adaptive masked autoencoder architecture for pre-training Vision Transformers on variable-contrast MRI data, improving infarct segmentation accuracy in medical imaging.

Contribution

It proposes a novel adaptive Masked Autoencoder model that handles varying numbers of input contrasts in 3D medical images, enhancing pre-training effectiveness.

Findings

01

Pretraining with AMAE improves infarct segmentation by 2.8%-3.7%.

02

The method effectively handles variable input contrasts in MRI data.

03

Self pre-training enhances model performance in medical image analysis.

Abstract

The Masked Autoencoder (MAE) has recently demonstrated effectiveness in pre-training Vision Transformers (ViT) for analyzing natural images. By reconstructing complete images from partially masked inputs, the ViT encoder gathers contextual information to predict the missing regions. This capability to aggregate context is especially important in medical imaging, where anatomical structures are functionally and mechanically linked to surrounding regions. However, current methods do not consider variations in the number of input images, which is typically the case in real-world Magnetic Resonance (MR) studies. To address this limitation, we propose a 3D Adaptive Masked Autoencoders (AMAE) architecture that accommodates a variable number of 3D input contrasts per subject. A magnetic resonance imaging (MRI) dataset of 45,364 subjects was used for pretraining and a subset of 1648 training,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · 3D Shape Modeling and Analysis · Radiomics and Machine Learning in Medical Imaging