Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with   Attributes

Alessio Mazzetto; Cristina Menghini; Andrew Yuan; Eli Upfal; Stephen; H. Bach

arXiv:2205.13068·cs.LG·November 29, 2022·1 cites

Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes

Alessio Mazzetto, Cristina Menghini, Andrew Yuan, Eli Upfal, Stephen, H. Bach

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper establishes a tight, computable lower bound on the worst-case error for zero-shot learning with attributes, revealing the intrinsic difficulty of the problem based on the class-attribute matrix.

Contribution

It provides the first non-trivial, tight lower bound on zero-shot learning error that is based solely on the class-attribute matrix and is practically computable.

Findings

01

Lower bound characterizes intrinsic difficulty of zero-shot learning.

02

Bound is tight and achievable with a randomized map.

03

Analysis predicts class confusion in practical zero-shot methods.

Abstract

We develop a rigorous mathematical analysis of zero-shot learning with attributes. In this setting, the goal is to label novel classes with no training data, only detectors for attributes and a description of how those attributes are correlated with the target classes, called the class-attribute matrix. We develop the first non-trivial lower bound on the worst-case error of the best map from attributes to classes for this setting, even with perfect attribute detectors. The lower bound characterizes the theoretical intrinsic difficulty of the zero-shot problem based on the available information -- the class-attribute matrix -- and the bound is practically computable from it. Our lower bound is tight, as we show that we can always find a randomized map from attributes to classes whose expected error is upper bounded by the value of the lower bound. We show that our analysis can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BatsResearch/mazzetto-neurips22-code
pytorchOfficial

Videos

Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning