SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization
Austin W. Hanjie, Ameet Deshpande, Karthik Narasimhan

TL;DR
SemSup introduces a scalable, multi-description, and hybrid approach for zero-shot learning that surpasses previous methods by leveraging diverse description formats and fine-grained similarity, improving generalization across multiple datasets and modalities.
Contribution
SemSup proposes a novel scalable multi-description sampling method, alternative description formats like JSON, and a hybrid lexical-semantic similarity for enhanced zero-shot learning.
Findings
Increases unseen class accuracy by 15 points on average.
Effective across four datasets and two modalities.
Outperforms baseline methods significantly.
Abstract
Zero-shot learning is the problem of predicting instances over classes not seen during training. One approach to zero-shot learning is providing auxiliary class information to the model. Prior work along this vein have largely used expensive per-instance annotation or singular class-level descriptions, but per-instance descriptions are hard to scale and single class descriptions may not be rich enough. Furthermore, these works have used natural-language descriptions exclusively, simple bi-encoders models, and modality or task-specific methods. These approaches have several limitations: text supervision may not always be available or optimal and bi-encoders may only learn coarse relations between inputs and class descriptions. In this work, we present SemSup, a novel approach that uses (1) a scalable multiple description sampling method which improves performance over single…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification
