Few-Shot Domain Adaptation with Polymorphic Transformers
Shaohua Li, Xiuchao Sui, Jie Fu, Huazhu Fu, Xiangde Luo, Yangqin Feng,, Xinxing Xu, Yong Liu, Daniel Ting, Rick Siow Mong Goh

TL;DR
This paper introduces Polyformer, a polymorphic transformer module that enables effective few-shot domain adaptation in medical image segmentation by leveraging prototype embeddings and minimal parameter updates.
Contribution
The paper proposes a novel Polyformer layer that facilitates few-shot domain adaptation by extracting source prototypes and updating only a projection layer, reducing overfitting and improving robustness.
Findings
Polyformer improves segmentation accuracy on unseen domains.
Effective with minimal annotated data in target domain.
Reduces overfitting by freezing most model weights during adaptation.
Abstract
Deep neural networks (DNNs) trained on one set of medical images often experience severe performance drop on unseen test images, due to various domain discrepancy between the training images (source domain) and the test images (target domain), which raises a domain adaptation issue. In clinical settings, it is difficult to collect enough annotated target domain data in a short period. Few-shot domain adaptation, i.e., adapting a trained model with a handful of annotations, is highly practical and useful in this case. In this paper, we propose a Polymorphic Transformer (Polyformer), which can be incorporated into any DNN backbones for few-shot domain adaptation. Specifically, after the polyformer layer is inserted into a model trained on the source domain, it extracts a set of prototype embeddings, which can be viewed as a "basis" of the source-domain features. On the target domain, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · AI in cancer detection · Fetal and Pediatric Neurological Disorders
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Adam · Label Smoothing · Residual Connection · Dense Connections · Softmax
