Self-Supervised Pre-Training of Swin Transformers for 3D Medical Image Analysis
Yucheng Tang, Dong Yang, Wenqi Li, Holger Roth, Bennett Landman,, Daguang Xu, Vishwesh Nath, Ali Hatamizadeh

TL;DR
This paper introduces a novel 3D transformer-based model, Swin UNETR, with tailored self-supervised proxy tasks for medical image analysis, achieving state-of-the-art results on multiple segmentation benchmarks.
Contribution
The paper presents a new hierarchical 3D transformer model and specific proxy tasks for self-supervised learning in medical imaging, improving transfer learning performance.
Findings
Pre-trained on 5,050 CT images from various organs.
Achieved first place on MSD and BTCV segmentation leaderboards.
Demonstrated superior transfer learning effectiveness in medical image segmentation.
Abstract
Vision Transformers (ViT)s have shown great performance in self-supervised learning of global and local representations that can be transferred to downstream applications. Inspired by these results, we introduce a novel self-supervised learning framework with tailored proxy tasks for medical image analysis. Specifically, we propose: (i) a new 3D transformer-based model, dubbed Swin UNEt TRansformers (Swin UNETR), with a hierarchical encoder for self-supervised pre-training; (ii) tailored proxy tasks for learning the underlying pattern of human anatomy. We demonstrate successful pre-training of the proposed model on 5,050 publicly available computed tomography (CT) images from various body organs. The effectiveness of our approach is validated by fine-tuning the pre-trained models on the Beyond the Cranial Vault (BTCV) Segmentation Challenge with 13 abdominal organs and segmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗osanseviero/swin_unetr_btcv_segmentationmodel· ♡ 1♡ 1
- 🤗monai-test/swin_unetr_btcv_segmentationmodel· ♡ 1♡ 1
- 🤗AnonRes/PrimusM-OpenMind-MAEmodel· 1 dl1 dl
- 🤗AnonRes/ResEncL-OpenMind-MAEmodel· 19 dl· ♡ 119 dl♡ 1
- 🤗AnonRes/ResEncL-OpenMind-S3Dmodel· 11 dl11 dl
- 🤗AnonRes/ResEncL-OpenMind-VFmodel· 6 dl6 dl
- 🤗AnonRes/ResEncL-OpenMind-VoComodel· 6 dl6 dl
- 🤗AnonRes/ResEncL-OpenMind-MGmodel
- 🤗AnonRes/ResEncL-OpenMind-SimCLRmodel· 2 dl2 dl
- 🤗AnonRes/ResEncL-OpenMind-SwinUNETRmodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging · AI in cancer detection
