Enabling Small Models for Zero-Shot Selection and Reuse through Model   Label Learning

Jia Zhang; Zhi Zhou; Lan-Zhe Guo; Yu-Feng Li

arXiv:2408.11449·cs.AI·February 4, 2025

Enabling Small Models for Zero-Shot Selection and Reuse through Model Label Learning

Jia Zhang, Zhi Zhou, Lan-Zhe Guo, Yu-Feng Li

PDF

Open Access

TL;DR

This paper introduces Model Label Learning (MLL), a scalable approach that enables small models to perform zero-shot task selection and reuse by aligning models with their functionalities through a semantic graph.

Contribution

The paper proposes MLL and CHCO algorithms to effectively select and reuse models for new tasks, bridging the gap between expert and foundation models.

Findings

01

MLL improves zero-shot task performance using a model hub.

02

CHCO effectively selects models for new tasks.

03

Experiments validate MLL's scalability and effectiveness.

Abstract

Vision-language models (VLMs) like CLIP have demonstrated impressive zero-shot ability in image classification tasks by aligning text and images but suffer inferior performance compared with task-specific expert models. On the contrary, expert models excel in their specialized domains but lack zero-shot ability for new tasks. How to obtain both the high performance of expert models and zero-shot ability is an important research direction. In this paper, we attempt to demonstrate that by constructing a model hub and aligning models with their functionalities using model labels, new tasks can be solved in a zero-shot manner by effectively selecting and reusing models in the hub. We introduce a novel paradigm, Model Label Learning (MLL), which bridges the gap between models and their functionalities through a Semantic Directed Acyclic Graph (SDAG) and leverages an algorithm, Classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification

MethodsContrastive Language-Image Pre-training