Continual World: A Robotic Benchmark For Continual Reinforcement Learning
Maciej Wo{\l}czyk, Micha{\l} Zaj\k{a}c, Razvan Pascanu, {\L}ukasz, Kuci\'nski, Piotr Mi{\l}o\'s

TL;DR
Continual World introduces a robotic benchmark for continual reinforcement learning emphasizing forward transfer, aiming to address limitations of existing methods and facilitate understanding of continual learning in robotics.
Contribution
The paper presents a new benchmark, Continual World, focused on realistic robotic tasks to evaluate continual RL methods beyond catastrophic forgetting.
Findings
Existing CL methods show limitations on the benchmark.
The benchmark highlights unique challenges in RL continual learning.
It provides a computationally inexpensive platform for future research.
Abstract
Continual learning (CL) -- the ability to continuously learn, building on previously acquired knowledge -- is a natural requirement for long-lived autonomous reinforcement learning (RL) agents. While building such agents, one needs to balance opposing desiderata, such as constraints on capacity and compute, the ability to not catastrophically forget, and to exhibit positive transfer on new tasks. Understanding the right trade-off is conceptually and computationally challenging, which we argue has led the community to overly focus on catastrophic forgetting. In response to these issues, we advocate for the need to prioritize forward transfer and propose Continual World, a benchmark consisting of realistic and meaningfully diverse robotic tasks built on top of Meta-World as a testbed. Following an in-depth empirical evaluation of existing CL methods, we pinpoint their limitations and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications
