Enhancing Robotic Manipulation: Harnessing the Power of Multi-Task Reinforcement Learning and Single Life Reinforcement Learning in Meta-World
Ghadi Nehme, Ishan Sabane, Tejas Y. Deo

TL;DR
This paper explores combining multi-task reinforcement learning with single-life reinforcement learning to improve robotic manipulation across diverse tasks in the Meta-World environment, demonstrating enhanced generalization and performance.
Contribution
It introduces the MT-QWALE algorithm that leverages multi-task SAC as prior data for single-life RL, improving task generalization in robotic manipulation.
Findings
MT-QWALE outperforms standard MT-SAC in task completion.
The approach generalizes better to unseen target positions.
Ablation shows robustness even when goal information is hidden.
Abstract
At present, robots typically require extensive training to successfully accomplish a single task. However, to truly enhance their usefulness in real-world scenarios, robots should possess the capability to perform multiple tasks effectively. To address this need, various multi-task reinforcement learning (RL) algorithms have been developed, including multi-task proximal policy optimization (PPO), multi-task trust region policy optimization (TRPO), and multi-task soft-actor critic (SAC). Nevertheless, these algorithms demonstrate optimal performance only when operating within an environment or observation space that exhibits a similar distribution. In reality, such conditions are often not the norm, as robots may encounter scenarios or observations that differ from those on which they were trained. Addressing this challenge, algorithms like Q-Weighted Adversarial Learning (QWALE) attempt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · EEG and Brain-Computer Interfaces
MethodsBalanced Selection
