A Universal Representation Transformer Layer for Few-Shot Image Classification
Lu Liu, William Hamilton, Guodong Long, Jing Jiang, Hugo Larochelle

TL;DR
This paper introduces a Universal Representation Transformer layer that meta-learns to effectively combine diverse domain-specific features, achieving state-of-the-art results in multi-domain few-shot image classification.
Contribution
The paper proposes the URT layer, a novel meta-learning module that dynamically re-weights and composes features from multiple domains for improved few-shot classification.
Findings
URT achieves state-of-the-art on Meta-Dataset.
URT performs well across many data sources.
Visualization shows effective cross-domain generalization.
Abstract
Few-shot classification aims to recognize unseen classes when presented with only a small number of samples. We consider the problem of multi-domain few-shot image classification, where unseen classes and examples come from diverse data sources. This problem has seen growing interest and has inspired the development of benchmarks such as Meta-Dataset. A key challenge in this multi-domain setting is to effectively integrate the feature representations from the diverse set of training domains. Here, we propose a Universal Representation Transformer (URT) layer, that meta-learns to leverage universal features for few-shot classification by dynamically re-weighting and composing the most appropriate domain-specific representations. In experiments, we show that URT sets a new state-of-the-art result on Meta-Dataset. Specifically, it achieves top-performance on the highest number of data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Machine Learning and ELM
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Label Smoothing · Multi-Head Attention · Adam · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout · Byte Pair Encoding
