Underrepresented in Foundation Model Pretraining Data? A One-Shot Probe
Chris Vorster, Mayug Maniparambil, Noel E. O'Connor, Noel Murphy, Derek Molloy

TL;DR
This paper introduces a data-efficient, one-shot probing method using large language models to predict the zero-shot accuracy of vision-language foundation models across diverse and underrepresented visual domains, aiding resource allocation.
Contribution
It presents a novel, low-cost approach to estimate VLFM performance in niche domains with minimal data, especially benefiting underrepresented regions like Africa.
Findings
Achieves a Pearson-r correlation of 0.96 in accuracy prediction
Effective across five diverse datasets including underrepresented domains
Provides a practical tool for informed decision-making in model deployment
Abstract
Large-scale Vision-Language Foundation Models (VLFMs), such as CLIP, now underpin a wide range of computer vision research and applications. VLFMs are often adapted to various domain-specific tasks. However, VLFM performance on novel, specialised, or underrepresented domains remains inconsistent. Evaluating VLFMs typically requires labelled test sets, which are often unavailable for niche domains of interest, particularly those from the Global South. We address this gap by proposing a highly data-efficient method to predict a VLFM's zero-shot accuracy on a target domain using only a single labelled image per class. Our approach uses a Large Language Model to generate plausible counterfactual descriptions of a given image. By measuring the VLFM's ability to distinguish the correct description from these hard negatives, we engineer features that capture the VLFM's discriminative power in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
