MOSformer: Momentum encoder-based inter-slice fusion transformer for medical image segmentation
De-Xing Huang, Xiao-Hu Zhou, Mei-Jiang Gui, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Zhen-Qiu Feng, Zhi-Chao Lai, Zeng-Guang Hou

TL;DR
MOSformer introduces a dual-encoder transformer with momentum-based inter-slice fusion for improved 2.5D medical image segmentation, achieving state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes MOSformer, a novel inter-slice fusion transformer utilizing dual encoders and a momentum encoder to enhance segmentation accuracy.
Findings
Achieves state-of-the-art DSC scores on three benchmark datasets.
Effectively fuses inter-slice information for better segmentation.
Outperforms existing 2.5D segmentation models.
Abstract
Medical image segmentation takes an important position in various clinical applications. 2.5D-based segmentation models bridge the computational efficiency of 2D-based models with the spatial perception capabilities of 3D-based models. However, existing 2.5D-based models primarily adopt a single encoder to extract features of target and neighborhood slices, failing to effectively fuse inter-slice information, resulting in suboptimal segmentation performance. In this study, a novel momentum encoder-based inter-slice fusion transformer (MOSformer) is proposed to overcome this issue by leveraging inter-slice information from multi-scale feature maps extracted by different encoders. Specifically, dual encoders are employed to enhance feature distinguishability among different slices. One of the encoders is moving-averaged to maintain consistent slice representations. Moreover, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Radiomics and Machine Learning in Medical Imaging
