Baby steps towards few-shot learning with multiple semantics
Eli Schwartz, Leonid Karlinsky, Rogerio Feris, Raja Giryes, Alex M., Bronstein

TL;DR
This paper explores how combining multiple types of semantic information, such as labels, attributes, and natural language descriptions, can enhance few-shot visual learning, achieving improved results on standard benchmarks.
Contribution
It introduces a method that leverages multiple richer semantics to improve few-shot learning performance, surpassing previous state-of-the-art results.
Findings
Combining multiple semantics improves few-shot learning accuracy.
The approach outperforms previous methods on miniImageNet and CUB benchmarks.
Ablation studies reveal the importance of each semantic component.
Abstract
Learning from one or few visual examples is one of the key capabilities of humans since early infancy, but is still a significant challenge for modern AI systems. While considerable progress has been achieved in few-shot learning from a few image examples, much less attention has been given to the verbal descriptions that are usually provided to infants when they are presented with a new object. In this paper, we focus on the role of additional semantics that can significantly facilitate few-shot visual learning. Building upon recent advances in few-shot learning with additional semantic information, we demonstrate that further improvements are possible by combining multiple and richer semantics (category labels, attributes, and natural language descriptions). Using these ideas, we offer the community new results on the popular miniImageNet and CUB few-shot benchmarks, comparing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
