TL;DR
TransNorm introduces a novel deep segmentation framework that integrates Transformer modules into U-Net, utilizing a spatial normalization mechanism in skip connections to enhance global context modeling and improve medical image segmentation accuracy.
Contribution
The paper proposes TransNorm, a new Transformer-based U-Net variant with a spatial normalization mechanism in skip connections for better feature fusion and segmentation performance.
Findings
Effective across three medical segmentation tasks
Outperforms traditional U-Net models
Demonstrates improved global context modeling
Abstract
In the past few years, convolutional neural networks (CNNs), particularly U-Net, have been the prevailing technique in the medical image processing era. Specifically, the seminal U-Net, as well as its alternatives, have successfully managed to address a wide variety of medical image segmentation tasks. However, these architectures are intrinsically imperfect as they fail to exhibit long-range interactions and spatial dependencies leading to a severe performance drop in the segmentation of medical images with variable shapes and structures. Transformers, preliminary proposed for sequence-to-sequence prediction, have arisen as surrogate architectures to precisely model global information assisted by the self-attention mechanism. Despite being feasibly designed, utilizing a pure Transformer for image segmentation purposes can result in limited localization capacity stemming from inadequate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Concatenated Skip Connection · Softmax · Convolution · Dense Connections · Dropout · Adam · Byte Pair Encoding · *Communicated@Fast*How Do I Communicate to Expedia?
