Combining Self-Training and Hybrid Architecture for Semi-supervised Abdominal Organ Segmentation
Wentao Liu, Weijin Xu, Songlin Yan, Lemeng Wang, Haoyuan Li, Huihua, Yang

TL;DR
This paper presents a semi-supervised abdominal organ segmentation method combining self-training with a hybrid CNN and Swin Transformer architecture, achieving high accuracy and efficiency on FLARE2022 data.
Contribution
It introduces a novel two-stage segmentation framework using a hybrid architecture for pseudo-label generation and training, improving performance and resource efficiency.
Findings
Achieved DSC of 0.8956 and NSD of 0.9316 on validation set.
Demonstrated fast inference time of 18.62 seconds.
Utilized low GPU memory of approximately 2000 MB.
Abstract
Abdominal organ segmentation has many important clinical applications, such as organ quantification, surgical planning, and disease diagnosis. However, manually annotating organs from CT scans is time-consuming and labor-intensive. Semi-supervised learning has shown the potential to alleviate this challenge by learning from a large set of unlabeled images and limited labeled samples. In this work, we follow the self-training strategy and employ a high-performance hybrid architecture (PHTrans) consisting of CNN and Swin Transformer for the teacher model to generate precise pseudo labels for unlabeled data. Afterward, we introduce them with labeled data together into a two-stage segmentation framework with lightweight PHTrans for training to improve the performance and generalization ability of the model while remaining efficient. Experiments on the validation set of FLARE2022 demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Radiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis
MethodsAttention Is All You Need · Linear Layer · Stochastic Depth · Swin Transformer · Position-Wise Feed-Forward Layer · Softmax · Byte Pair Encoding · Adam · Label Smoothing · Dense Connections
