All models are wrong, some are useful: Model Selection with Limited   Labels

Patrik Okanovic; Andreas Kirsch; Jannes Kasper; Torsten Hoefler,; Andreas Krause; Nezihe Merve G\"urel

arXiv:2410.13609·cs.LG·October 28, 2024

All models are wrong, some are useful: Model Selection with Limited Labels

Patrik Okanovic, Andreas Kirsch, Jannes Kasper, Torsten Hoefler,, Andreas Krause, Nezihe Merve G\"urel

PDF

Open Access 1 Repo

TL;DR

This paper presents MODEL SELECTOR, a label-efficient framework for selecting the best pretrained model for a target dataset by sampling highly informative examples, significantly reducing labeling costs.

Contribution

Introduces MODEL SELECTOR, a novel method for efficient model selection with limited labels, demonstrating substantial reductions in labeling costs across diverse datasets and models.

Findings

01

Reduces labeling cost by up to 94.15% for selecting the best model.

02

Achieves up to 72.41% cost reduction when selecting a near-best model.

03

Consistently outperforms baseline methods across 18 model collections and 16 datasets.

Abstract

We introduce MODEL SELECTOR, a framework for label-efficient selection of pretrained classifiers. Given a pool of unlabeled target data, MODEL SELECTOR samples a small subset of highly informative examples for labeling, in order to efficiently identify the best pretrained model for deployment on this target dataset. Through extensive experiments, we demonstrate that MODEL SELECTOR drastically reduces the need for labeled data while consistently picking the best or near-best performing model. Across 18 model collections on 16 different datasets, comprising over 1,500 pretrained models, MODEL SELECTOR reduces the labeling cost by up to 94.15% to identify the best model compared to the cost of the strongest baseline. Our results further highlight the robustness of MODEL SELECTOR in model selection, as it reduces the labeling cost by up to 72.41% when selecting a near-best model, whose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

robustml-lab/model-selector
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification