Procedural Content Generation: Better Benchmarks for Transfer   Reinforcement Learning

Matthias M\"uller-Brockhausen; Mike Preuss; Aske Plaat

arXiv:2105.14780·cs.LG·January 13, 2022

Procedural Content Generation: Better Benchmarks for Transfer Reinforcement Learning

Matthias M\"uller-Brockhausen, Mike Preuss, Aske Plaat

PDF

TL;DR

This paper reviews transfer reinforcement learning, highlighting the late adoption of deep learning, ongoing challenges in generalization, and the need for unified benchmarks and procedural content generation to advance the field.

Contribution

It provides a comprehensive analysis of current TRL research, identifies key challenges, and suggests future directions including benchmarks and procedural content generation.

Findings

01

Deep learning adoption in TRL started in 2018.

02

Generalization remains a major challenge in TRL.

03

Emerging benchmarks like Alchemy and Meta-World are promising.

Abstract

The idea of transfer in reinforcement learning (TRL) is intriguing: being able to transfer knowledge from one problem to another problem without learning everything from scratch. This promises quicker learning and learning more complex methods. To gain an insight into the field and to detect emerging trends, we performed a database search. We note a surprisingly late adoption of deep learning that starts in 2018. The introduction of deep learning has not yet solved the greatest challenge of TRL: generalization. Transfer between different domains works well when domains have strong similarities (e.g. MountainCar to Cartpole), and most TRL publications focus on different tasks within the same domain that have few differences. Most TRL applications we encountered compare their improvements against self-defined baselines, and the field is still missing unified benchmarks. We consider this…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.