Multi-dimension unified Swin Transformer for 3D Lesion Segmentation in Multiple Anatomical Locations
Shaoyan Pan, Yiqiao Liu, Sarah Halek, Michal Tomaszewski, Shubing, Wang, Richard Baumgartner, Jianda Yuan, Gregory Goldmacher, Antong Chen

TL;DR
This paper introduces a multi-dimension unified Swin Transformer model for 3D lesion segmentation, leveraging unlabeled data and multi-stage fine-tuning to improve accuracy across various anatomical locations.
Contribution
The novel MDU-ST model combines a Swin-transformer encoder with CNN decoder, enabling effective learning from unlabeled 3D data and multi-level fine-tuning for improved segmentation.
Findings
Significant improvement in Dice similarity coefficient over competing models
Effective utilization of unlabeled 3D volumes through self-supervised learning
Robust segmentation performance across multiple anatomical locations
Abstract
In oncology research, accurate 3D segmentation of lesions from CT scans is essential for the modeling of lesion growth kinetics. However, following the RECIST criteria, radiologists routinely only delineate each lesion on the axial slice showing the largest transverse area, and delineate a small number of lesions in 3D for research purposes. As a result, we have plenty of unlabeled 3D volumes and labeled 2D images, and scarce labeled 3D volumes, which makes training a deep-learning 3D segmentation model a challenging task. In this work, we propose a novel model, denoted a multi-dimension unified Swin transformer (MDU-ST), for 3D lesion segmentation. The MDU-ST consists of a Shifted-window transformer (Swin-transformer) encoder and a convolutional neural network (CNN) decoder, allowing it to adapt to 2D and 3D inputs and learn the corresponding semantic information in the same encoder.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis · AI in cancer detection
MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Stochastic Depth · Linear Layer · Dense Connections · Residual Connection · Layer Normalization · Swin Transformer
