Meta-Polyp: a baseline for efficient Polyp segmentation
Quoc-Huy Trinh

TL;DR
Meta-Polyp introduces a novel fusion of Meta-Former with UNet and new blocks to improve polyp segmentation, especially for challenging datasets, achieving top results in state-of-the-art benchmarks.
Contribution
The paper presents a new architecture combining Meta-Former with UNet and novel blocks to enhance global and local feature integration for polyp segmentation.
Findings
Achieved top performance on CVC-300, Kvasir, and CVC-ColonDB datasets.
Improved segmentation of out-of-distribution datasets and small polyps.
Enhanced boundary and texture recognition in medical images.
Abstract
In recent years, polyp segmentation has gained significant importance, and many methods have been developed using CNN, Vision Transformer, and Transformer techniques to achieve competitive results. However, these methods often face difficulties when dealing with out-of-distribution datasets, missing boundaries, and small polyps. In 2022, Meta-Former was introduced as a new baseline for vision, which not only improved the performance of multi-task computer vision but also addressed the limitations of the Vision Transformer and CNN family backbones. To further enhance segmentation, we propose a fusion of Meta-Former with UNet, along with the introduction of a Multi-scale Upsampling block with a level-up combination in the decoder stage to enhance the texture, also we propose the Convformer block base on the idea of the Meta-former to enhance the crucial information of the local feature.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Advanced Neural Network Applications · COVID-19 diagnosis using AI
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Residual Connection · Adam · Absolute Position Encodings · Softmax · Layer Normalization · Byte Pair Encoding
