Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning
Tianhe Yu, Deirdre Quillen, Zhanpeng He, Ryan Julian, Avnish Narayan,, Hayden Shively, Adithya Bellathur, Karol Hausman, Chelsea Finn, Sergey Levine

TL;DR
This paper introduces Meta-World, a comprehensive benchmark with 50 robotic manipulation tasks designed to evaluate and improve meta-reinforcement learning algorithms' ability to generalize across diverse tasks.
Contribution
It provides an open-source, broad task benchmark for meta-RL, highlighting current algorithms' limitations in multi-task generalization and fostering future research.
Findings
Algorithms perform well on individual tasks but struggle with multiple tasks.
Meta-World enables evaluation of generalization to new, unseen tasks.
Current methods need improvement for effective multi-task learning.
Abstract
Meta-reinforcement learning algorithms can enable robots to acquire new skills much more quickly, by leveraging prior experience to learn how to learn. However, much of the current research on meta-reinforcement learning focuses on task distributions that are very narrow. For example, a commonly used meta-reinforcement learning benchmark uses different running velocities for a simulated robot as different tasks. When policies are meta-trained on such narrow task distributions, they cannot possibly generalize to more quickly acquire entirely new tasks. Therefore, if the aim of these methods is to enable faster acquisition of entirely new behaviors, we must evaluate them on task distributions that are sufficiently broad to enable generalization to new behaviors. In this paper, we propose an open-source simulated benchmark for meta-reinforcement learning and multi-task learning consisting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Neural dynamics and brain function · Data Stream Mining Techniques
