A Simple and Robust Framework for Cross-Modality Medical Image   Segmentation applied to Vision Transformers

Matteo Bastico; David Ryckelynck; Laurent Cort\'e; Yannick Tillier,; Etienne Decenci\`ere

arXiv:2310.05572·eess.IV·February 28, 2024

A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers

Matteo Bastico, David Ryckelynck, Laurent Cort\'e, Yannick Tillier,, Etienne Decenci\`ere

PDF

Open Access 2 Repos

TL;DR

This paper introduces a simple, robust framework for cross-modality medical image segmentation using a single conditional model with adaptive normalization, significantly improving performance across different imaging modalities.

Contribution

It proposes a novel framework that enables effective cross-modality segmentation with a single model, eliminating the need for registered images or synthetic data generation.

Findings

01

Outperforms existing methods on the Multi-Modality Whole Heart Segmentation Challenge.

02

The Conditional Vision Transformer (C-ViT) encoder improves segmentation accuracy by up to 6.87% Dice.

03

The framework works with non-registered, mixed modality data.

Abstract

When it comes to clinical images, automatic segmentation has a wide variety of applications and a considerable diversity of input domains, such as different types of Magnetic Resonance Images (MRIs) and Computerized Tomography (CT) scans. This heterogeneity is a challenge for cross-modality algorithms that should equally perform independently of the input image type fed to them. Often, segmentation models are trained using a single modality, preventing generalization to other types of input data without resorting to transfer learning techniques. Furthermore, the multi-modal or cross-modality architectures proposed in the literature frequently require registered images, which are not easy to collect in clinical environments, or need additional processing steps, such as synthetic image generation. In this work, we propose a simple framework to achieve fair image segmentation of multiple…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Advanced Image and Video Retrieval Techniques

MethodsMulti-Head Attention · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Attention Is All You Need · Dense Connections · Vision Transformer · Label Smoothing · Adam · Absolute Position Encodings