ACC-UNet: A Completely Convolutional UNet model for the 2020s
Nabil Ibtehaz, Daisuke Kihara

TL;DR
ACC-UNet is a fully convolutional neural network that emulates transformer advantages like long-range dependencies, achieving state-of-the-art medical image segmentation performance with fewer parameters.
Contribution
The paper introduces ACC-UNet, a novel convolutional UNet that incorporates transformer-inspired features without using transformers, outperforming existing models.
Findings
Outperforms state-of-the-art models Swin-Unet and UCTransNet in dice score.
Uses significantly fewer parameters than competing models.
Consistently outperforms convnets, transformers, and hybrids across five benchmarks.
Abstract
This decade is marked by the introduction of Vision Transformer, a radical paradigm shift in broad computer vision. A similar trend is followed in medical imaging, UNet, one of the most influential architectures, has been redesigned with transformers. Recently, the efficacy of convolutional models in vision is being reinvestigated by seminal works such as ConvNext, which elevates a ResNet to Swin Transformer level. Deriving inspiration from this, we aim to improve a purely convolutional UNet model so that it can be on par with the transformer-based models, e.g, Swin-Unet or UCTransNet. We examined several advantages of the transformer-based UNet models, primarily long-range dependencies and cross-level skip connections. We attempted to emulate them through convolution operations and thus propose, ACC-UNet, a completely convolutional UNet model that brings the best of both worlds, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Advanced Neural Network Applications · AI in cancer detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention · Attention Is All You Need · Average Pooling · Sigmoid Activation · 1x1 Convolution · Instance Normalization · Linear Layer · Global Average Pooling · Softmax
