Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation
Yue Han, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Yong Liu, Lu Qi,, Xiangtai Li, Ming-Hsuan Yang

TL;DR
This paper introduces Reference Twice, a transformer-based unified framework for Few-Shot Instance Segmentation that effectively leverages support-query relationships, avoids overfitting, and extends to incremental learning with improved performance.
Contribution
The paper presents a novel transformer-based baseline for FSIS, simplifies support-query interaction, and proposes a class-enhanced distillation loss for incremental learning.
Findings
Achieves +8.2/+9.4 performance gain over state-of-the-art on COCO.
Effectively avoids overfitting in FSIS.
Enables easy extension to incremental FSIS.
Abstract
Few-Shot Instance Segmentation (FSIS) requires detecting and segmenting novel classes with limited support examples. Existing methods based on Region Proposal Networks (RPNs) face two issues: 1) Overfitting suppresses novel class objects; 2) Dual-branch models require complex spatial correlation strategies to prevent spatial information loss when generating class prototypes. We introduce a unified framework, Reference Twice (RefT), to exploit the relationship between support and query features for FSIS and related tasks. Our three main contributions are: 1) A novel transformer-based baseline that avoids overfitting, offering a new direction for FSIS; 2) Demonstrating that support object queries encode key factors after base training, allowing query features to be enhanced twice at both feature and query levels using simple cross-attention, thus avoiding complex spatial correlation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Human Pose and Action Recognition · Anomaly Detection Techniques and Applications
MethodsBalanced Selection
