Automatically Composing Representation Transformations as a Means for Generalization
Michael B. Chang, Abhishek Gupta, Sergey Levine, Thomas L. Griffiths

TL;DR
This paper presents a formalism and a recursive learning framework that enable models to compose representation transformations, improving their ability to generalize to more complex tasks by leveraging shared subproblems.
Contribution
It introduces the compositional problem graph and the compositional recursive learner, advancing the ability of models to generalize through task composition and analogy-based reasoning.
Findings
The approach generalizes to more complex problems than previous models.
Non-compositional baselines fail to generalize as effectively.
The framework applies to both symbolic and high-dimensional domains.
Abstract
A generally intelligent learner should generalize to more complex tasks than it has previously encountered, but the two common paradigms in machine learning -- either training a separate learner per task or training a single learner for all tasks -- both have difficulty with such generalization because they do not leverage the compositional structure of the task distribution. This paper introduces the compositional problem graph as a broadly applicable formalism to relate tasks of different complexity in terms of problems with shared subproblems. We propose the compositional generalization problem for measuring how readily old knowledge can be reused and hence built upon. As a first step for tackling compositional generalization, we introduce the compositional recursive learner, a domain-general framework for learning algorithmic procedures for composing representation transformations,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · AI-based Problem Solving and Planning
