nnY-Net: Swin-NeXt with Cross-Attention for 3D Medical Images Segmentation
Haixu Liu, Zerui Tao, Wenzhen Dong, Qiuzhuang Sun

TL;DR
This paper introduces nnY-Net, a novel 3D medical image segmentation model combining Swin Transformer and ConvNeXt with cross-attention, enhancing feature integration and training efficiency.
Contribution
The paper proposes nnY-Net, integrating Swin Transformer and ConvNeXt with cross-attention in a Y-structured model for improved 3D medical image segmentation.
Findings
Achieved improved segmentation accuracy on 3D medical images.
Enhanced training efficiency with DiceFocalCELoss.
Simplified data processing in the nnU-net framework.
Abstract
This paper provides a novel 3D medical image segmentation model structure called nnY-Net. This name comes from the fact that our model adds a cross-attention module at the bottom of the U-net structure to form a Y structure. We integrate the advantages of the two latest SOTA models, MedNeXt and SwinUNETR, and use Swin Transformer as the encoder and ConvNeXt as the decoder to innovatively design the Swin-NeXt structure. Our model uses the lowest-level feature map of the encoder as Key and Value and uses patient features such as pathology and treatment information as Query to calculate the attention weights in a Cross Attention module. Moreover, we simplify some pre- and post-processing as well as data enhancement methods in 3D image segmentation based on the dynUnet and nnU-net frameworks. We integrate our proposed Swin-NeXt with Cross-Attention framework into this framework. Last, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Advanced Neural Network Applications · Medical Image Segmentation Techniques
MethodsMax Pooling · Convolution · Byte Pair Encoding · Linear Layer · Softmax · Dense Connections · *Communicated@Fast*How Do I Communicate to Expedia? · Attention Is All You Need · Absolute Position Encodings · Dropout
