Navigating High Dimensional Concept Space with Metalearning
Max Gupta

TL;DR
This paper explores how gradient-based meta-learning can help neural networks efficiently learn abstract, compositional concepts from few examples, especially in high-dimensional spaces, by analyzing different complexity regimes and optimization strategies.
Contribution
It systematically compares meta-learning and supervised learning on Boolean concepts, revealing when meta-learning improves few-shot learning and analyzing the role of second-order optimization methods.
Findings
Meta-learners outperform supervised learning on compositional complexity.
Featural complexity increases loss landscape roughness, affecting optimization.
Increasing adaptation steps improves out-of-distribution generalization on complex concepts.
Abstract
Rapidly learning abstract concepts from limited examples is a hallmark of human intelligence. This work investigates whether gradient-based meta-learning can equip neural networks with inductive biases for efficient few-shot acquisition of discrete concepts. I compare meta-learning methods against a supervised learning baseline on Boolean concepts (logical statements) generated by a probabilistic context-free grammar (PCFG). By systematically varying concept dimensionality (number of features) and recursive compositionality (depth of grammar recursion), I delineate between complexity regimes in which meta-learning robustly improves few-shot concept learning and regimes in which it does not. Meta-learners are much better able to handle compositional complexity than featural complexity. I highlight some reasons for this with a representational analysis of the weights of meta-learners and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Advanced Graph Neural Networks
