Mix&Match - Agent Curricula for Reinforcement Learning

Wojciech Marian Czarnecki; Siddhant M. Jayakumar; Max Jaderberg,; Leonard Hasenclever; Yee Whye Teh; Simon Osindero; Nicolas Heess; Razvan; Pascanu

arXiv:1806.01780·cs.LG·June 6, 2018·1 cites

Mix&Match - Agent Curricula for Reinforcement Learning

Wojciech Marian Czarnecki, Siddhant M. Jayakumar, Max Jaderberg,, Leonard Hasenclever, Yee Whye Teh, Simon Osindero, Nicolas Heess, Razvan, Pascanu

PDF

Open Access

TL;DR

Mix&Match (M&M) is a novel RL training framework that automatically creates curricula by progressively altering internal policy representations, enabling faster learning and better performance across diverse tasks and architectures.

Contribution

We introduce Mix&Match, a new curriculum learning method that modifies internal policy representations to facilitate efficient training of complex RL agents.

Findings

01

Achieved faster training and higher performance in a 3D first-person task.

02

Successfully progressed through internal architecture variants.

03

Improved multitask learning performance with our method.

Abstract

We introduce Mix&Match (M&M) - a training framework designed to facilitate rapid and effective learning in RL agents, especially those that would be too slow or too challenging to train otherwise. The key innovation is a procedure that allows us to automatically form a curriculum over agents. Through such a curriculum we can progressively train more complex agents by, effectively, bootstrapping from solutions found by simpler agents. In contradistinction to typical curriculum learning approaches, we do not gradually modify the tasks or environments presented, but instead use a process to gradually alter how the policy is represented internally. We show the broad applicability of our method by demonstrating significant performance gains in three different experimental setups: (1) We train an agent able to control more than 700 actions in a challenging 3D first-person task; using our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multi-Agent Systems and Negotiation · Software Engineering Research