Self-Calibrated Cross Attention Network for Few-Shot Segmentation
Qianxiong Xu, Wenting Zhao, Guosheng Lin, Cheng Long

TL;DR
This paper introduces a self-calibrated cross attention network that improves few-shot segmentation by better aligning and fusing support and query features, especially background features, leading to superior performance on standard benchmarks.
Contribution
The proposed SCCA block with patch alignment and scaled-cosine similarity enhances feature fusion in few-shot segmentation, addressing limitations of previous prototype and cross attention methods.
Findings
Achieves 5.6% higher mIoU on COCO-20^i 5-shot setting.
Outperforms previous state-of-the-art methods on PASCAL-5^i and COCO-20^i.
Demonstrates effective support-query feature alignment and fusion.
Abstract
The key to the success of few-shot segmentation (FSS) lies in how to effectively utilize support samples. Most solutions compress support foreground (FG) features into prototypes, but lose some spatial details. Instead, others use cross attention to fuse query features with uncompressed support FG. Query FG could be fused with support FG, however, query background (BG) cannot find matched BG features in support FG, yet inevitably integrates dissimilar features. Besides, as both query FG and BG are combined with support FG, they get entangled, thereby leading to ineffective segmentation. To cope with these issues, we design a self-calibrated cross attention (SCCA) block. For efficient patch-based attention, query and support features are firstly split into patches. Then, we design a patch alignment module to align each query patch with its most similar support patch for better cross…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Medical Image Segmentation Techniques
MethodsALIGN
