Med-URWKV: Pure RWKV With ImageNet Pre-training For Medical Image Segmentation
Zhenhuan Zhou

TL;DR
This paper introduces Med-URWKV, a pure RWKV-based medical image segmentation model that leverages ImageNet pretraining, demonstrating improved performance over models trained from scratch and highlighting the benefits of pretraining in this domain.
Contribution
We propose Med-URWKV, the first pure RWKV segmentation model in medical imaging that utilizes ImageNet pretraining to enhance segmentation accuracy.
Findings
Achieves comparable or superior performance to models trained from scratch.
Validates the effectiveness of pretraining VRWKV encoders for medical segmentation.
Demonstrates strong long-range modeling with linear complexity.
Abstract
Medical image segmentation is a fundamental and key technology in computer-aided diagnosis and treatment. Previous methods can be broadly classified into three categories: convolutional neural network (CNN) based, Transformer based, and hybrid architectures that combine both. However, each of them has its own limitations, such as restricted receptive fields in CNNs or the computational overhead caused by the quadratic complexity of Transformers. Recently, the Receptance Weighted Key Value (RWKV) model has emerged as a promising alternative for various vision tasks, offering strong long-range modeling capabilities with linear computational complexity. Some studies have also adapted RWKV to medical image segmentation tasks, achieving competitive performance. However, most of these studies focus on modifications to the Vision-RWKV (VRWKV) mechanism and train models from scratch, without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · AI in cancer detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Absolute Position Encodings · Layer Normalization · Max Pooling · Byte Pair Encoding · Label Smoothing · Softmax · Dropout · Dense Connections · Transformer
