NestedFormer: Nested Modality-Aware Transformer for Brain Tumor Segmentation
Zhaohu Xing, Lequan Yu, Liang Wan, Tong Han, Lei Zhu

TL;DR
NestedFormer introduces a novel transformer-based approach for brain tumor segmentation that explicitly models intra- and inter-modality relationships in multi-modal MRI data, outperforming existing methods.
Contribution
The paper proposes NestedFormer, a new nested modality-aware transformer architecture with a multi-encoder and single-decoder structure for improved multi-modal MRI segmentation.
Findings
Outperforms state-of-the-art on BraTS2020 benchmark
Effective multi-modal fusion via nested attention mechanisms
Demonstrates superior accuracy on MeniSeg dataset
Abstract
Multi-modal MR imaging is routinely used in clinical practice to diagnose and investigate brain tumors by providing rich complementary information. Previous multi-modal MRI segmentation methods usually perform modal fusion by concatenating multi-modal MRIs at an early/middle stage of the network, which hardly explores non-linear dependencies between modalities. In this work, we propose a novel Nested Modality-Aware Transformer (NestedFormer) to explicitly explore the intra-modality and inter-modality relationships of multi-modal MRIs for brain tumor segmentation. Built on the transformer-based multi-encoder and single-decoder structure, we perform nested multi-modal fusion for high-level representations of different modalities and apply modality-sensitive gating (MSG) at lower scales for more effective skip connections. Specifically, the multi-modal fusion is conducted in our proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Layer Normalization · Softmax · Adam · Position-Wise Feed-Forward Layer · Dropout · Dense Connections
