TL;DR
DoubleU-Net is a novel deep learning architecture that stacks two U-Net models with enhanced features, significantly improving medical image segmentation accuracy across diverse datasets and imaging modalities.
Contribution
The paper introduces DoubleU-Net, a new architecture combining two U-Nets with ASPP and a pre-trained encoder, achieving superior segmentation performance over existing models.
Findings
Outperforms U-Net and baseline models on multiple datasets.
Achieves higher accuracy in challenging segmentation tasks.
Demonstrates strong generalizability across different medical imaging modalities.
Abstract
Semantic image segmentation is the process of labeling each pixel of an image with its corresponding class. An encoder-decoder based approach, like U-Net and its variants, is a popular strategy for solving medical image segmentation tasks. To improve the performance of U-Net on various segmentation tasks, we propose a novel architecture called DoubleU-Net, which is a combination of two U-Net architectures stacked on top of each other. The first U-Net uses a pre-trained VGG-19 as the encoder, which has already learned features from ImageNet and can be transferred to another task easily. To capture more semantic information efficiently, we added another U-Net at the bottom. We also adopt Atrous Spatial Pyramid Pooling (ASPP) to capture contextual information within the network. We have evaluated DoubleU-Net using four medical segmentation datasets, covering various imaging modalities such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsVisual Geometry Group 19 Layer CNN · Concatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · U-Net · Spatial Pyramid Pooling
