Hardware Conditioned Policies for Multi-Robot Transfer Learning

Tao Chen; Adithyavairavan Murali; Abhinav Gupta

arXiv:1811.09864·cs.RO·January 15, 2019·33 cites

Hardware Conditioned Policies for Multi-Robot Transfer Learning

Tao Chen, Adithyavairavan Murali, Abhinav Gupta

PDF

Open Access 1 Repo

TL;DR

This paper introduces Hardware Conditioned Policies, a universal reinforcement learning approach that enables zero-shot transfer and efficient fine-tuning of robotic policies across diverse hardware configurations.

Contribution

The paper presents a novel hardware-conditioned policy framework that generalizes across different robot morphologies and dynamics, reducing training costs for new robots.

Findings

01

Zero-shot transfer to unseen robots using hardware encoding.

02

Fine-tuning is more sample-efficient than training from scratch.

03

Hardware embeddings improve policy generalization.

Abstract

Deep reinforcement learning could be used to learn dexterous robotic policies but it is challenging to transfer them to new robots with vastly different hardware properties. It is also prohibitively expensive to learn a new policy from scratch for each robot hardware due to the high sample complexity of modern state-of-the-art algorithms. We propose a novel approach called \textit{Hardware Conditioned Policies} where we train a universal policy conditioned on a vector representation of robot hardware. We considered robots in simulation with varied dynamics, kinematic structure, kinematic lengths and degrees-of-freedom. First, we use the kinematic structure directly as the hardware encoding and show great zero-shot transfer to completely novel robots not seen during training. For robots with lower zero-shot success rate, we also demonstrate that fine-tuning the policy network is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

taochenshh/hcp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Robot Manipulation and Learning