When does compositional structure yield compositional generalization? A kernel theory
Samuel Lippl, Kim Stachenfeld

TL;DR
This paper develops a kernel theory to understand when compositional structure leads to generalization, revealing fundamental limitations and failure modes in models like neural networks, with empirical validation on deep learning architectures.
Contribution
It introduces a theoretical framework for compositional generalization in kernel models, identifying key limitations and failure modes, and validates these findings empirically on neural networks.
Findings
Kernel models are limited to conjunction-wise additive functions.
They cannot transitively generalize equivalence relations.
Training data biases cause memorization leak and shortcut bias.
Abstract
Compositional generalization (the ability to respond correctly to novel combinations of familiar components) is thought to be a cornerstone of intelligent behavior. Compositionally structured (e.g. disentangled) representations support this ability; however, the conditions under which they are sufficient for the emergence of compositional generalization remain unclear. To address this gap, we present a theory of compositional generalization in kernel models with fixed, compositionally structured representations. This provides a tractable framework for characterizing the impact of training data statistics on generalization. We find that these models are limited to functions that assign values to each combination of components seen during training, and then sum up these values ("conjunction-wise additivity"). This imposes fundamental restrictions on the set of tasks compositionally…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeochemistry and Geologic Mapping
MethodsSparse Evolutionary Training
