Efficient Robotic Manipulation Through Offline-to-Online Reinforcement   Learning and Goal-Aware State Information

Jin Li; Xianyuan Zhan; Zixu Xiao; Guyue Zhou

arXiv:2110.10905·cs.RO·October 22, 2021·1 cites

Efficient Robotic Manipulation Through Offline-to-Online Reinforcement Learning and Goal-Aware State Information

Jin Li, Xianyuan Zhan, Zixu Xiao, Guyue Zhou

PDF

Open Access

TL;DR

This paper presents a unified offline-to-online reinforcement learning framework with goal-aware state information and advanced representation learning, significantly improving data efficiency and performance in robotic manipulation tasks.

Contribution

It introduces a novel unified framework that mitigates transition performance drops and incorporates goal-aware states to enhance learning efficiency in robotic manipulation.

Findings

01

Outperforms state-of-the-art methods in multiple tasks

02

Reduces training time and data requirements

03

Improves policy robustness and success rates

Abstract

End-to-end learning robotic manipulation with high data efficiency is one of the key challenges in robotics. The latest methods that utilize human demonstration data and unsupervised representation learning has proven to be a promising direction to improve RL learning efficiency. The use of demonstration data also allows "warming-up" the RL policies using offline data with imitation learning or the recently emerged offline reinforcement learning algorithms. However, existing works often treat offline policy learning and online exploration as two separate processes, which are often accompanied by severe performance drop during the offline-to-online transition. Furthermore, many robotic manipulation tasks involve complex sub-task structures, which are very challenging to be solved in RL with sparse reward. In this work, we propose a unified offline-to-online RL framework that resolves the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Domain Adaptation and Few-Shot Learning