Continual Hippocampus Segmentation with Transformers

Amin Ranem; Camila Gonz\'alez; Anirban Mukhopadhyay

arXiv:2204.08043·eess.IV·April 19, 2022·1 cites

Continual Hippocampus Segmentation with Transformers

Amin Ranem, Camila Gonz\'alez, Anirban Mukhopadhyay

PDF

Open Access 1 Repo

TL;DR

This paper investigates the use of Transformer-based models for continual hippocampus segmentation, demonstrating their potential to reduce catastrophic forgetting in medical image analysis compared to traditional convolutional models.

Contribution

It provides an analysis of Transformer mechanisms in sequential learning for medical segmentation and offers insights on adapting continual learning strategies for these models.

Findings

01

Transformers mitigate catastrophic forgetting better than convolutional models.

02

Regularising ViT modules requires careful consideration.

03

Transformer mechanisms improve robustness in sequential hippocampus segmentation.

Abstract

In clinical settings, where acquisition conditions and patient populations change over time, continual learning is key for ensuring the safe use of deep neural networks. Yet most existing work focuses on convolutional architectures and image classification. Instead, radiologists prefer to work with segmentation models that outline specific regions-of-interest, for which Transformer-based architectures are gaining traction. The self-attention mechanism of Transformers could potentially mitigate catastrophic forgetting, opening the way for more robust medical image segmentation. In this work, we explore how recently-proposed Transformer mechanisms for semantic segmentation behave in sequential learning scenarios, and analyse how best to adapt continual learning strategies for this setting. Our evaluation on hippocampus segmentation shows that Transformer mechanisms mitigate catastrophic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MECLabTUDA/Lifelong-nnUNet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Artificial Intelligence in Healthcare and Education · Radiomics and Machine Learning in Medical Imaging

MethodsAttention Is All You Need · Linear Layer · Softmax · Dropout · Label Smoothing · Adam · Multi-Head Attention · Residual Connection · Absolute Position Encodings · Byte Pair Encoding