Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes
Alessio Mazzetto, Cristina Menghini, Andrew Yuan, Eli Upfal, Stephen, H. Bach

TL;DR
This paper establishes a tight, computable lower bound on the worst-case error for zero-shot learning with attributes, revealing the intrinsic difficulty of the problem based on the class-attribute matrix.
Contribution
It provides the first non-trivial, tight lower bound on zero-shot learning error that is based solely on the class-attribute matrix and is practically computable.
Findings
Lower bound characterizes intrinsic difficulty of zero-shot learning.
Bound is tight and achievable with a randomized map.
Analysis predicts class confusion in practical zero-shot methods.
Abstract
We develop a rigorous mathematical analysis of zero-shot learning with attributes. In this setting, the goal is to label novel classes with no training data, only detectors for attributes and a description of how those attributes are correlated with the target classes, called the class-attribute matrix. We develop the first non-trivial lower bound on the worst-case error of the best map from attributes to classes for this setting, even with perfect attribute detectors. The lower bound characterizes the theoretical intrinsic difficulty of the zero-shot problem based on the available information -- the class-attribute matrix -- and the bound is practically computable from it. Our lower bound is tight, as we show that we can always find a randomized map from attributes to classes whose expected error is upper bounded by the value of the lower bound. We show that our analysis can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
