Enhancing Robotic Manipulation: Harnessing the Power of Multi-Task   Reinforcement Learning and Single Life Reinforcement Learning in Meta-World

Ghadi Nehme; Ishan Sabane; Tejas Y. Deo

arXiv:2311.12854·cs.AI·November 23, 2023·2 cites

Enhancing Robotic Manipulation: Harnessing the Power of Multi-Task Reinforcement Learning and Single Life Reinforcement Learning in Meta-World

Ghadi Nehme, Ishan Sabane, Tejas Y. Deo

PDF

Open Access

TL;DR

This paper explores combining multi-task reinforcement learning with single-life reinforcement learning to improve robotic manipulation across diverse tasks in the Meta-World environment, demonstrating enhanced generalization and performance.

Contribution

It introduces the MT-QWALE algorithm that leverages multi-task SAC as prior data for single-life RL, improving task generalization in robotic manipulation.

Findings

01

MT-QWALE outperforms standard MT-SAC in task completion.

02

The approach generalizes better to unseen target positions.

03

Ablation shows robustness even when goal information is hidden.

Abstract

At present, robots typically require extensive training to successfully accomplish a single task. However, to truly enhance their usefulness in real-world scenarios, robots should possess the capability to perform multiple tasks effectively. To address this need, various multi-task reinforcement learning (RL) algorithms have been developed, including multi-task proximal policy optimization (PPO), multi-task trust region policy optimization (TRPO), and multi-task soft-actor critic (SAC). Nevertheless, these algorithms demonstrate optimal performance only when operating within an environment or observation space that exhibits a similar distribution. In reality, such conditions are often not the norm, as robots may encounter scenarios or observations that differ from those on which they were trained. Addressing this challenge, algorithms like Q-Weighted Adversarial Learning (QWALE) attempt…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · EEG and Brain-Computer Interfaces

MethodsBalanced Selection