Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic   Platforms

Ali Ghadirzadeh; Xi Chen; Petra Poklukar; Chelsea Finn; M{\aa}rten; Bj\"orkman; Danica Kragic

arXiv:2103.03697·cs.RO·March 8, 2021

Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms

Ali Ghadirzadeh, Xi Chen, Petra Poklukar, Chelsea Finn, M{\aa}rten, Bj\"orkman, Danica Kragic

PDF

Open Access

TL;DR

This paper introduces a probabilistic meta-learning framework for efficiently adapting reinforcement learning policies to new robotic platforms with minimal data, addressing hardware variability challenges.

Contribution

It proposes a novel probabilistic gradient-based meta-learning approach that models uncertainty with a low-dimensional latent variable for few-shot policy adaptation across robots.

Findings

01

Successfully adapts policies to new robots with few demonstrations

02

Outperforms state-of-the-art meta-learning methods in experiments

03

Effective on both simulated and real-robot tasks

Abstract

Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics