Multi-Layer Dense Attention Decoder for Polyp Segmentation
Krushi Patel, Fengjun Li, Guanghui Wang

TL;DR
This paper introduces a novel multi-layer dense attention decoder with a Dense Attention Gate for improved polyp segmentation, addressing local relation learning and feature aggregation issues in vision Transformer-based models.
Contribution
It proposes a hierarchical dense decoder with a Dense Attention Gate module, enhancing local feature relations and semantic feature aggregation for polyp segmentation.
Findings
Achieves state-of-the-art performance on five polyp datasets.
Outperforms nine competing models on four datasets.
Demonstrates robustness across diverse polyp variations.
Abstract
Detecting and segmenting polyps is crucial for expediting the diagnosis of colon cancer. This is a challenging task due to the large variations of polyps in color, texture, and lighting conditions, along with subtle differences between the polyp and its surrounding area. Recently, vision Transformers have shown robust abilities in modeling global context for polyp segmentation. However, they face two major limitations: the inability to learn local relations among multi-level layers and inadequate feature aggregation in the decoder. To address these issues, we propose a novel decoder architecture aimed at hierarchically aggregating locally enhanced multi-level dense features. Specifically, we introduce a novel module named Dense Attention Gate (DAG), which adaptively fuses all previous layers' features to establish local feature relations among all layers. Furthermore, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle License Plate Recognition · Handwritten Text Recognition Techniques
Methodsfast speak--How do I Speak to someone at Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Dense Connections · Layer Normalization · Absolute Position Encodings · Spatial-Reduction Attention · Residual Connection
