Adversarial Skill Chaining for Long-Horizon Robot Manipulation via   Terminal State Regularization

Youngwoon Lee; Joseph J. Lim; Anima Anandkumar; Yuke Zhu

arXiv:2111.07999·cs.LG·November 16, 2021·5 cites

Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization

Youngwoon Lee, Joseph J. Lim, Anima Anandkumar, Yuke Zhu

PDF

Open Access

TL;DR

This paper introduces an adversarial regularization method for chaining skills in long-horizon robot manipulation tasks, enabling successful policy composition without requiring extensive initial state coverage.

Contribution

It proposes a novel terminal state regularization technique within an adversarial learning framework to improve skill chaining for complex manipulation tasks.

Findings

01

First model-free RL algorithm to solve furniture assembly tasks

02

Outperforms prior skill chaining approaches on long-horizon tasks

03

Achieves successful long-horizon manipulation without large initial state distributions

Abstract

Skill chaining is a promising approach for synthesizing complex behaviors by sequentially combining previously learned skills. Yet, a naive composition of skills fails when a policy encounters a starting state never seen during its training. For successful skill chaining, prior approaches attempt to widen the policy's starting state distribution. However, these approaches require larger state distributions to be covered as more policies are sequenced, and thus are limited to short skill sequences. In this paper, we propose to chain multiple policies without excessively large initial state distributions by regularizing the terminal state distributions in an adversarial learning framework. We evaluate our approach on two complex long-horizon manipulation tasks of furniture assembly. Our results have shown that our method establishes the first model-free reinforcement learning algorithm to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Robot Manipulation and Learning