Bayesian Meta-Learning for Few-Shot Policy Adaptation Across Robotic Platforms
Ali Ghadirzadeh, Xi Chen, Petra Poklukar, Chelsea Finn, M{\aa}rten, Bj\"orkman, Danica Kragic

TL;DR
This paper introduces a probabilistic meta-learning framework for efficiently adapting reinforcement learning policies to new robotic platforms with minimal data, addressing hardware variability challenges.
Contribution
It proposes a novel probabilistic gradient-based meta-learning approach that models uncertainty with a low-dimensional latent variable for few-shot policy adaptation across robots.
Findings
Successfully adapts policies to new robots with few demonstrations
Outperforms state-of-the-art meta-learning methods in experiments
Effective on both simulated and real-robot tasks
Abstract
Reinforcement learning methods can achieve significant performance but require a large amount of training data collected on the same robotic platform. A policy trained with expensive data is rendered useless after making even a minor change to the robot hardware. In this paper, we address the challenging problem of adapting a policy, trained to perform a task, to a novel robotic hardware platform given only few demonstrations of robot motion trajectories on the target robot. We formulate it as a few-shot meta-learning problem where the goal is to find a meta-model that captures the common structure shared across different robotic platforms such that data-efficient adaptation can be performed. We achieve such adaptation by introducing a learning framework consisting of a probabilistic gradient-based meta-learning algorithm that models the uncertainty arising from the few-shot setting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics
