Enabling Small Models for Zero-Shot Selection and Reuse through Model Label Learning
Jia Zhang, Zhi Zhou, Lan-Zhe Guo, Yu-Feng Li

TL;DR
This paper introduces Model Label Learning (MLL), a scalable approach that enables small models to perform zero-shot task selection and reuse by aligning models with their functionalities through a semantic graph.
Contribution
The paper proposes MLL and CHCO algorithms to effectively select and reuse models for new tasks, bridging the gap between expert and foundation models.
Findings
MLL improves zero-shot task performance using a model hub.
CHCO effectively selects models for new tasks.
Experiments validate MLL's scalability and effectiveness.
Abstract
Vision-language models (VLMs) like CLIP have demonstrated impressive zero-shot ability in image classification tasks by aligning text and images but suffer inferior performance compared with task-specific expert models. On the contrary, expert models excel in their specialized domains but lack zero-shot ability for new tasks. How to obtain both the high performance of expert models and zero-shot ability is an important research direction. In this paper, we attempt to demonstrate that by constructing a model hub and aligning models with their functionalities using model labels, new tasks can be solved in a zero-shot manner by effectively selecting and reusing models in the hub. We introduce a novel paradigm, Model Label Learning (MLL), which bridges the gap between models and their functionalities through a Semantic Directed Acyclic Graph (SDAG) and leverages an algorithm, Classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
MethodsContrastive Language-Image Pre-training
