Cross-Domain Few-Shot Learning via Adaptive Transformer Networks

Naeem Paeedeh; Mahardhika Pratama; Muhammad Anwar Ma'sum; Wolfgang; Mayer; Zehong Cao; Ryszard Kowlczyk

arXiv:2401.13987·cs.LG·January 26, 2024·1 cites

Cross-Domain Few-Shot Learning via Adaptive Transformer Networks

Naeem Paeedeh, Mahardhika Pratama, Muhammad Anwar Ma'sum, Wolfgang, Mayer, Zehong Cao, Ryszard Kowlczyk

PDF

Open Access 1 Repo

TL;DR

This paper introduces ADAPTER, an adaptive transformer network designed for cross-domain few-shot learning, effectively handling large domain shifts by learning transferable features and improving prediction consistency.

Contribution

It proposes a novel adaptive transformer architecture with bidirectional cross-attention and label smoothing, advancing cross-domain few-shot learning capabilities.

Findings

01

Outperforms prior methods on BSCD-FSL benchmarks

02

Uses DINO for diverse feature extraction

03

Achieves significant performance improvements

Abstract

Most few-shot learning works rely on the same domain assumption between the base and the target tasks, hindering their practical applications. This paper proposes an adaptive transformer network (ADAPTER), a simple but effective solution for cross-domain few-shot learning where there exist large domain shifts between the base task and the target task. ADAPTER is built upon the idea of bidirectional cross-attention to learn transferable features between the two domains. The proposed architecture is trained with DINO to produce diverse, and less biased features to avoid the supervision collapse problem. Furthermore, the label smoothing approach is proposed to improve the consistency and reliability of the predictions by also considering the predicted labels of the close samples in the embedding space. The performance of ADAPTER is rigorously evaluated in the BSCD-FSL benchmarks in which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

naeem-paeedeh/adapter
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsAttention Is All You Need · Softmax · Layer Normalization · Residual Connection · Linear Layer · Multi-Head Attention · Dense Connections · Vision Transformer · Label Smoothing · self-DIstillation with NO labels