DoubleU-Net: A Deep Convolutional Neural Network for Medical Image   Segmentation

Debesh Jha; Michael A. Riegler; Dag Johansen; P{\aa}l Halvorsen,; H{\aa}vard D. Johansen

arXiv:2006.04868·eess.IV·June 30, 2020

DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation

Debesh Jha, Michael A. Riegler, Dag Johansen, P{\aa}l Halvorsen,, H{\aa}vard D. Johansen

PDF

4 Repos

TL;DR

DoubleU-Net is a novel deep learning architecture that stacks two U-Net models with enhanced features, significantly improving medical image segmentation accuracy across diverse datasets and imaging modalities.

Contribution

The paper introduces DoubleU-Net, a new architecture combining two U-Nets with ASPP and a pre-trained encoder, achieving superior segmentation performance over existing models.

Findings

01

Outperforms U-Net and baseline models on multiple datasets.

02

Achieves higher accuracy in challenging segmentation tasks.

03

Demonstrates strong generalizability across different medical imaging modalities.

Abstract

Semantic image segmentation is the process of labeling each pixel of an image with its corresponding class. An encoder-decoder based approach, like U-Net and its variants, is a popular strategy for solving medical image segmentation tasks. To improve the performance of U-Net on various segmentation tasks, we propose a novel architecture called DoubleU-Net, which is a combination of two U-Net architectures stacked on top of each other. The first U-Net uses a pre-trained VGG-19 as the encoder, which has already learned features from ImageNet and can be transferred to another task easily. To capture more semantic information efficiently, we added another U-Net at the bottom. We also adopt Atrous Spatial Pyramid Pooling (ASPP) to capture contextual information within the network. We have evaluated DoubleU-Net using four medical segmentation datasets, covering various imaging modalities such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsVisual Geometry Group 19 Layer CNN · Concatenated Skip Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Convolution · U-Net · Spatial Pyramid Pooling