LCA-on-the-Line: Benchmarking Out-of-Distribution Generalization with   Class Taxonomies

Jia Shi; Gautam Gare; Jinjin Tian; Siqi Chai; Zhiqiu Lin; Arun; Vasudevan; Di Feng; Francesco Ferroni; Shu Kong

arXiv:2407.16067·cs.LG·July 24, 2024

LCA-on-the-Line: Benchmarking Out-of-Distribution Generalization with Class Taxonomies

Jia Shi, Gautam Gare, Jinjin Tian, Siqi Chai, Zhiqiu Lin, Arun, Vasudevan, Di Feng, Francesco Ferroni, Shu Kong

PDF

1 Repo

TL;DR

This paper introduces the LCA-on-the-Line framework to predict out-of-distribution performance of models using class hierarchies, revealing strong correlations and improving understanding of model generalization across diverse datasets.

Contribution

It proposes a novel hierarchical distance measure, LCA-on-the-Line, and demonstrates its effectiveness in predicting OOD accuracy and enhancing model generalization through taxonomy alignment.

Findings

01

Strong linear correlation between ID LCA distance and OOD accuracy.

02

LCA distance remains robust across different taxonomic hierarchies.

03

Aligning predictions with class taxonomies improves model generalization.

Abstract

We tackle the challenge of predicting models' Out-of-Distribution (OOD) performance using in-distribution (ID) measurements without requiring OOD data. Existing evaluations with "Effective Robustness", which use ID accuracy as an indicator of OOD accuracy, encounter limitations when models are trained with diverse supervision and distributions, such as class labels (Vision Models, VMs, on ImageNet) and textual descriptions (Visual-Language Models, VLMs, on LAION). VLMs often generalize better to OOD data than VMs despite having similar or lower ID performance. To improve the prediction of models' OOD performance from ID measurements, we introduce the Lowest Common Ancestor (LCA)-on-the-Line framework. This approach revisits the established concept of LCA distance, which measures the hierarchical distance between labels and predictions within a predefined class hierarchy, such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

elvishelvis/lca-on-the-line
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.