Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation
Xinyu Shi, Dong Wei, Yu Zhang, Donghuan Lu, Munan Ning, Jiashun Chen,, Kai Ma, and Yefeng Zheng

TL;DR
This paper introduces DCAMA, a novel pixel-wise attention method for few-shot segmentation that fully exploits support information and improves state-of-the-art performance on multiple benchmarks.
Contribution
The paper proposes a dense pixel-wise cross-query-and-support attention mechanism using Transformer architecture for few-shot segmentation, enabling full support information utilization.
Findings
Significant improvements in 1-shot mIoU on PASCAL-5i, COCO-20i, and FSS-1000 benchmarks.
Effective one-pass inference for n-shot segmentation.
Ablation studies confirming the effectiveness of DCAMA design.
Abstract
Research into Few-shot Semantic Segmentation (FSS) has attracted great attention, with the goal to segment target objects in a query image given only a few annotated support images of the target class. A key to this challenging task is to fully utilize the information in the support images by exploiting fine-grained correlations between the query and support images. However, most existing approaches either compressed the support information into a few class-wise prototypes, or used partial support information (e.g., only foreground) at the pixel level, causing non-negligible information loss. In this paper, we propose Dense pixel-wise Cross-query-and-support Attention weighted Mask Aggregation (DCAMA), where both foreground and background support information are fully exploited via multi-level pixel-wise correlations between paired query and support features. Implemented with the scaled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Medical Image Segmentation Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Dropout · Byte Pair Encoding · Adam · Residual Connection · Label Smoothing · Position-Wise Feed-Forward Layer
