Meta-Learning of Structured Task Distributions in Humans and Machines
Sreejan Kumar, Ishita Dasgupta, Jonathan D. Cohen, Nathaniel D. Daw,, Thomas L. Griffiths

TL;DR
This paper compares human and machine meta-learning on structured tasks generated by a grammar, revealing humans excel with structure while agents perform better on unstructured, statistically similar tasks, highlighting differences in learning strategies.
Contribution
It introduces a new structured task distribution and a null control distribution, and compares human and machine meta-learning, revealing fundamental differences in their use of task structure.
Findings
Humans outperform machines on structured tasks.
Machines perform better on unstructured, null tasks.
Meta-learners may not utilize task structure as humans do.
Abstract
In recent years, meta-learning, in which a model is trained on a family of tasks (i.e. a task distribution), has emerged as an approach to training neural networks to perform tasks that were previously assumed to require structured representations, making strides toward closing the gap between humans and machines. However, we argue that evaluating meta-learning remains a challenge, and can miss whether meta-learning actually uses the structure embedded within the tasks. These meta-learners might therefore still be significantly different from humans learners. To demonstrate this difference, we first define a new meta-reinforcement learning task in which a structured task distribution is generated using a compositional grammar. We then introduce a novel approach to constructing a "null task distribution" with the same statistical complexity as this structured task distribution but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
