Few-Shot Segmentation via Cycle-Consistent Transformer
Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei

TL;DR
This paper introduces CyCTR, a novel cycle-consistent transformer that leverages pixel-wise support-query relationships for improved few-shot segmentation, significantly outperforming previous methods on standard benchmarks.
Contribution
The paper proposes a cycle-consistent attention mechanism within a transformer framework to better utilize pixel-wise support information for few-shot segmentation.
Findings
Achieves 67.5% mIoU on Pascal-5^i for 5-shot segmentation.
Achieves 45.6% mIoU on COCO-20^i for 5-shot segmentation.
Outperforms previous state-of-the-art methods by 5.6% and 7.1%.
Abstract
Few-shot segmentation aims to train a segmentation model that can fast adapt to novel classes with few exemplars. The conventional training paradigm is to learn to make predictions on query images conditioned on the features from support images. Previous methods only utilized the semantic-level prototypes of support images as conditional information. These methods cannot utilize all pixel-wise support information for the query predictions, which is however critical for the segmentation task. In this paper, we focus on utilizing pixel-wise relationships between support and query images to facilitate the few-shot segmentation task. We design a novel Cycle-Consistent TRansformer (CyCTR) module to aggregate pixel-wise support features into query ones. CyCTR performs cross-attention between features from different images, i.e. support and query images. We observe that there may exist…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsAttention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Label Smoothing · Residual Connection · Dense Connections · Softmax · Multi-Head Attention
