M^2UNet: MetaFormer Multi-scale Upsampling Network for Polyp Segmentation
Quoc-Huy Trinh, Nhat-Tan Bui, Trong-Hieu Nguyen Mau, Minh-Van Nguyen,, Hai-Minh Phan, Minh-Triet Tran, Hai-Dang Nguyen

TL;DR
M^2UNet is a novel polyp segmentation network that combines MetaFormer, CNN, and Transformer elements with multi-scale upsampling to improve feature aggregation and segmentation accuracy.
Contribution
The paper introduces M^2UNet, integrating MetaFormer with a multi-scale upsampling block within a UNet framework for enhanced polyp segmentation.
Findings
Achieved competitive results on five benchmark datasets.
Effectively exploits multi-level features for better segmentation.
Demonstrates the benefit of combining CNN and Transformer features.
Abstract
Polyp segmentation has recently garnered significant attention, and multiple methods have been formulated to achieve commendable outcomes. However, these techniques often confront difficulty when working with the complex polyp foreground and their surrounding regions because of the nature of convolution operation. Besides, most existing methods forget to exploit the potential information from multiple decoder stages. To address this challenge, we suggest combining MetaFormer, introduced as a baseline for integrating CNN and Transformer, with UNet framework and incorporating our Multi-scale Upsampling block (MU). This simple module makes it possible to combine multi-level information by exploring multiple receptive field paths of the shallow decoder stage and then adding with the higher stage to aggregate better feature representation, which is essential in medical image segmentation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Colorectal Cancer Screening and Detection · AI in cancer detection
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Absolute Position Encodings · Layer Normalization · Residual Connection · Softmax · Byte Pair Encoding
