ModelLens: Finding the Best for Your Task from Myriads of Models
Rui Cai, Weijie Jacky Mo, Xiaofei Wen, Qiyao Ma, Wenhui Zhu, Xiwen Chen, Muhao Chen, and Zhe Zhao

TL;DR
ModelLens is a unified framework that leverages scattered leaderboard data to recommend the best models for new datasets without expensive evaluations, improving model selection across diverse tasks.
Contribution
It introduces a performance-aware latent space learned from public leaderboard interactions, enabling effective model ranking without running models on target datasets.
Findings
Outperforms baselines relying on metadata or direct evaluation.
Improves model routing methods by up to 81%.
Generalizes well to text and vision-language tasks.
Abstract
The open-source model ecosystem now contains hundreds of thousands of pretrained models, yet picking the best model for a new dataset is increasingly infeasible: new models and unbenchmarked datasets emerge continuously, leaving practitioners with no prior records on either side. Existing approaches handle only fragments of this in-the-wild setting: AutoML and transferability estimation select models from small predefined pools or require expensive per-model forward passes on the target dataset, while model routing presupposes a given candidate pool. We introduce ModelLens, a unified framework for model recommendation in the wild. Our key insight is that public leaderboard interactions, though scattered and noisy, collectively trace out an implicit atlas of model capabilities across heterogeneous evaluation settings, a signal rich enough to learn from directly. By learning a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
