Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation
Pengfei Gu, Huimin Li, Yejia Zhang, Chaoli Wang, and Danny Z. Chen

TL;DR
This paper introduces a novel self pre-training method for 3D medical image segmentation that incorporates topology- and spatial-awareness into Masked Autoencoders, improving geometric and spatial feature learning.
Contribution
It proposes a topological loss and a spatial pre-text task to enhance MAE pre-training for 3D medical images, extending the architecture with a hybrid segmentation model.
Findings
Improved segmentation accuracy on five public datasets.
Effective preservation of geometric shape information.
Enhanced spatial feature aggregation in pre-training.
Abstract
Masked Autoencoders (MAEs) have been shown to be effective in pre-training Vision Transformers (ViTs) for natural and medical image analysis problems. By reconstructing missing pixel/voxel information in visible patches, a ViT encoder can aggregate contextual information for downstream tasks. But, existing MAE pre-training methods, which were specifically developed with the ViT architecture, lack the ability to capture geometric shape and spatial information, which is critical for medical image segmentation tasks. In this paper, we propose a novel extension of known MAEs for self pre-training (i.e., models pre-trained on the same target dataset) for 3D medical image segmentation. (1) We propose a new topological loss to preserve geometric shape information by computing topological signatures of both the input and reconstructed volumes, learning geometric shape information. (2) We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Medical Image Segmentation Techniques
MethodsMasked autoencoder
