Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment   Whole-Body Organs in CT with Light-Weighted Adaptation

Vince Zhu; Zhanghexuan Ji; Dazhou Guo; Puyang Wang; Yingda Xia; Le Lu,; Xianghua Ye; Wei Zhu; Dakai Jin

arXiv:2410.04689·cs.CV·October 8, 2024

Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation

Vince Zhu, Zhanghexuan Ji, Dazhou Guo, Puyang Wang, Yingda Xia, Le Lu,, Xianghua Ye, Wei Zhu, Dakai Jin

PDF

Open Access

TL;DR

This paper introduces a low-rank adaptation method for continual whole-body organ segmentation in CT scans, enabling incremental learning without catastrophic forgetting and with minimal parameter growth.

Contribution

It proposes a light-weighted LoRA approach applied to a frozen PVT model, identifying key layers for effective adaptation in continual segmentation tasks.

Findings

01

Achieves high segmentation accuracy close to upper bounds.

02

Outperforms regularization-based CSS methods.

03

Maintains low parameter growth during continual learning.

Abstract

Deep segmentation networks achieve high performance when trained on specific datasets. However, in clinical practice, it is often desirable that pretrained segmentation models can be dynamically extended to enable segmenting new organs without access to previous training datasets or without training from scratch. This would ensure a much more efficient model development and deployment paradigm accounting for the patient privacy and data storage issues. This clinically preferred process can be viewed as a continual semantic segmentation (CSS) problem. Previous CSS works would either experience catastrophic forgetting or lead to unaffordable memory costs as models expand. In this work, we propose a new continual whole-body organ segmentation model with light-weighted low-rank adaptation (LoRA). We first train and freeze a pyramid vision transformer (PVT) base segmentation model on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced X-ray and CT Imaging · Medical Imaging Techniques and Applications · Radiomics and Machine Learning in Medical Imaging

MethodsAttention Is All You Need · Linear Layer · Spatial-Reduction Attention · Absolute Position Encodings · Vision Transformer · Softmax · Multi-Head Attention · Layer Normalization · Dense Connections · Balanced Selection