Model-agnostic Measure of Generalization Difficulty
Akhilan Boopathy, Kevin Liu, Jaedong Hwang, Shu Ge, Asaad, Mohammedsaleh, Ila Fiete

TL;DR
This paper introduces a novel, model-agnostic measure of the inherent difficulty of generalization tasks in machine learning, quantifying the complexity of tasks across different learning paradigms.
Contribution
It proposes the first measure of generalization difficulty that is independent of specific models, based on inductive bias complexity and intrinsic dimensionality.
Findings
The measure scales exponentially with intrinsic dimensionality.
It can compare difficulty across supervised, reinforcement, and meta-learning.
Empirical results match expected difficulty trends among datasets.
Abstract
The measure of a machine learning algorithm is the difficulty of the tasks it can perform, and sufficiently difficult tasks are critical drivers of strong machine learning models. However, quantifying the generalization difficulty of machine learning benchmarks has remained challenging. We propose what is to our knowledge the first model-agnostic measure of the inherent generalization difficulty of tasks. Our inductive bias complexity measure quantifies the total information required to generalize well on a task minus the information provided by the data. It does so by measuring the fractional volume occupied by hypotheses that generalize on a task given that they fit the training data. It scales exponentially with the intrinsic dimensionality of the space over which the model must generalize but only polynomially in resolution per dimension, showing that tasks which require…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)
