Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing Modalities

Peibo Song; Xiaotian Xue; Jinshuo Zhang; Zihao Wang; Jinhua Liu; Shujun Fu; Fangxun Bao; Si Yong Yeo

arXiv:2604.22177·cs.CV·April 27, 2026

Uni-Encoder Meets Multi-Encoders: Representation Before Fusion for Brain Tumor Segmentation with Missing Modalities

Peibo Song, Xiaotian Xue, Jinshuo Zhang, Zihao Wang, Jinhua Liu, Shujun Fu, Fangxun Bao, Si Yong Yeo

PDF

1 Repo

TL;DR

This paper introduces UniME, a two-stage architecture combining a unified ViT encoder and modality-specific CNN encoders, to improve brain tumor segmentation with missing MRI modalities, demonstrating superior performance on BraTS datasets.

Contribution

The novel two-stage UniME framework effectively handles missing modalities by decoupling representation learning from segmentation, outperforming prior methods on BraTS benchmarks.

Findings

01

UniME achieves higher segmentation accuracy with missing modalities.

02

The unified representation is robust across different missing modality scenarios.

03

Experimental results on BraTS 2023 and 2024 validate the method's effectiveness.

Abstract

Multimodal MRI offers complementary information for brain tumor segmentation, but clinical scans often lack one or more modalities, which degrades segmentation performance. In this paper, we propose UniME (Uni-Encoder Meets Multi-Encoders), a two-stage heterogeneous method for brain tumor segmentation with missing modalities that reconciles the trade-offs among fine-grained structure capture, cross-modal complementarity modeling, and exploitation of available modalities. The idea is to decouple representation learning from segmentation via a two-stage heterogeneous architecture. Stage 1 pretrains a single ViT Uni-Encoder with masked image modeling to establish a unified representation robust to missing modalities. Stage 2 adds modality-specific CNN Multi-Encoders to extract high-resolution, multi-scale, fine-grained features. We fuse these features with the global representation to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Hooorace-S/UniME
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.