TL;DR
This paper introduces the LCA-on-the-Line framework to predict out-of-distribution performance of models using class hierarchies, revealing strong correlations and improving understanding of model generalization across diverse datasets.
Contribution
It proposes a novel hierarchical distance measure, LCA-on-the-Line, and demonstrates its effectiveness in predicting OOD accuracy and enhancing model generalization through taxonomy alignment.
Findings
Strong linear correlation between ID LCA distance and OOD accuracy.
LCA distance remains robust across different taxonomic hierarchies.
Aligning predictions with class taxonomies improves model generalization.
Abstract
We tackle the challenge of predicting models' Out-of-Distribution (OOD) performance using in-distribution (ID) measurements without requiring OOD data. Existing evaluations with "Effective Robustness", which use ID accuracy as an indicator of OOD accuracy, encounter limitations when models are trained with diverse supervision and distributions, such as class labels (Vision Models, VMs, on ImageNet) and textual descriptions (Visual-Language Models, VLMs, on LAION). VLMs often generalize better to OOD data than VMs despite having similar or lower ID performance. To improve the prediction of models' OOD performance from ID measurements, we introduce the Lowest Common Ancestor (LCA)-on-the-Line framework. This approach revisits the established concept of LCA distance, which measures the hierarchical distance between labels and predictions within a predefined class hierarchy, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
