Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning
Jin Hwa Lee, Stefano Sarao Mannelli, Andrew Saxe

TL;DR
This paper develops a theoretical framework for understanding how curriculum shaping enhances learning of complex, compositional tasks in deep reinforcement learning, revealing the dynamics and benefits of different training strategies.
Contribution
It introduces an analytical model of deep policy gradient learning for compositional tasks, providing insights into shaping strategies and their impact on learning efficiency.
Findings
Pre-training task primitives improves learning speed.
Task complexity influences shaping strategy effectiveness.
Theoretical analysis explains benefits of curriculum shaping in reinforcement learning.
Abstract
Diverse studies in systems neuroscience begin with extended periods of curriculum training known as `shaping' procedures. These involve progressively studying component parts of more complex tasks, and can make the difference between learning a task quickly, slowly or not at all. Despite the importance of shaping to the acquisition of complex tasks, there is as yet no theory that can help guide the design of shaping procedures, or more fundamentally, provide insight into its key role in learning. Modern deep reinforcement learning systems might implicitly learn compositional primitives within their multilayer policy networks. Inspired by these models, we propose and analyse a model of deep policy gradient learning of simple compositional reinforcement learning tasks. Using the tools of statistical physics, we solve for exact learning dynamics and characterise different learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducation and Critical Thinking Development
