Adaptive Cross-Modal Few-Shot Learning
Chen Xing, Negar Rostamzadeh, Boris N. Oreshkin, Pedro O. Pinheiro

TL;DR
This paper introduces an adaptive cross-modal approach for few-shot learning that combines visual and semantic information to improve classification accuracy, especially in low-shot scenarios.
Contribution
It proposes a novel mechanism that adaptively fuses visual and semantic features based on the new categories, outperforming existing uni-modal and modality-alignment methods.
Findings
Outperforms current uni-modality and modality-alignment methods on all benchmarks.
Effectively adjusts focus between visual and semantic modalities.
Significant improvements in very low-shot scenarios.
Abstract
Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose to leverage cross-modal information to enhance metric-based few-shot learning methods. Visual and semantic feature spaces have different structures by definition. For certain concepts, visual features might be richer and more discriminative than text ones. While for others, the inverse might be true. Moreover, when the support from visual information is limited in image classification, semantic representations (learned from unsupervised text corpora) can provide strong prior knowledge and context to help learning. Based on these two intuitions, we propose a mechanism that can adaptively combine information from both modalities according to new image categories to be learned. Through a series of experiments, we show that by this adaptive combination of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
