Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation

Pengfei Gu; Huimin Li; Yejia Zhang; Chaoli Wang; and Danny Z. Chen

arXiv:2406.10519·cs.CV·November 20, 2025

Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation

Pengfei Gu, Huimin Li, Yejia Zhang, Chaoli Wang, and Danny Z. Chen

PDF

Open Access

TL;DR

This paper introduces a novel self pre-training method for 3D medical image segmentation that incorporates topology- and spatial-awareness into Masked Autoencoders, improving geometric and spatial feature learning.

Contribution

It proposes a topological loss and a spatial pre-text task to enhance MAE pre-training for 3D medical images, extending the architecture with a hybrid segmentation model.

Findings

01

Improved segmentation accuracy on five public datasets.

02

Effective preservation of geometric shape information.

03

Enhanced spatial feature aggregation in pre-training.

Abstract

Masked Autoencoders (MAEs) have been shown to be effective in pre-training Vision Transformers (ViTs) for natural and medical image analysis problems. By reconstructing missing pixel/voxel information in visible patches, a ViT encoder can aggregate contextual information for downstream tasks. But, existing MAE pre-training methods, which were specifically developed with the ViT architecture, lack the ability to capture geometric shape and spatial information, which is critical for medical image segmentation tasks. In this paper, we propose a novel extension of known MAEs for self pre-training (i.e., models pre-trained on the same target dataset) for 3D medical image segmentation. (1) We propose a new topological loss to preserve geometric shape information by computing topological signatures of both the input and reconstructed volumes, learning geometric shape information. (2) We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Medical Image Segmentation Techniques

MethodsMasked autoencoder