Simulation-Aided Policy Tuning for Black-Box Robot Learning

Shiming He; Alexander von Rohr; Dominik Baumann; Ji Xiang; Sebastian; Trimpe

arXiv:2411.14246·cs.RO·February 11, 2025

Simulation-Aided Policy Tuning for Black-Box Robot Learning

Shiming He, Alexander von Rohr, Dominik Baumann, Ji Xiang, Sebastian, Trimpe

PDF

1 Repo

TL;DR

This paper introduces a simulation-aided black-box policy search algorithm that leverages both real robot data and simulation to enable fast, data-efficient learning and adaptation in robotic tasks.

Contribution

It presents a novel dual-information source optimization algorithm that improves policy learning efficiency by combining real-world experiments with simulation data.

Findings

01

Reduces robot interaction time significantly

02

Achieves high-probability policy improvements

03

Demonstrates successful real robot task learning

Abstract

How can robots learn and adapt to new tasks and situations with little data? Systematic exploration and simulation are crucial tools for efficient robot learning. We present a novel black-box policy search algorithm focused on data-efficient policy improvements. The algorithm learns directly on the robot and treats simulation as an additional information source to speed up the learning process. At the core of the algorithm, a probabilistic model learns the dependence of the policy parameters and the robot learning objective not only by performing experiments on the robot, but also by leveraging data from a simulator. This substantially reduces interaction time with the robot. Using this model, we can guarantee improvements with high probability for each policy update, thereby facilitating fast, goal-oriented learning. We evaluate our algorithm on simulated fine-tuning tasks and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

data-science-in-mechanical-engineering/hci-gibo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings