Keyframe Demonstration Seeded and Bayesian Optimized Policy Search

Onur Berk Tore; Farzin Negahbani; Baris Akgun

arXiv:2301.08184·cs.RO·January 20, 2023·1 cites

Keyframe Demonstration Seeded and Bayesian Optimized Policy Search

Onur Berk Tore, Farzin Negahbani, Baris Akgun

PDF

Open Access

TL;DR

This paper presents a new Learning from Demonstration framework that combines keyframe demonstrations, Dynamic Bayesian Networks, and Bayesian Optimized Policy Search to enhance robotic skill learning and exploration efficiency.

Contribution

It introduces BO-PI2, a Bayesian optimized policy search method that leverages perceptual relations and reward prediction to improve learning from demonstrations.

Findings

01

BO-PI2 outperforms state-of-the-art methods in real robot tasks.

02

The approach effectively focuses exploration on failed sub-goals.

03

Increased reward and success rates demonstrate improved learning efficiency.

Abstract

This paper introduces a novel Learning from Demonstration framework to learn robotic skills with keyframe demonstrations using a Dynamic Bayesian Network (DBN) and a Bayesian Optimized Policy Search approach to improve the learned skills. DBN learns the robot motion, perceptual change in the object of interest (aka skill sub-goals) and the relation between them. The rewards are also learned from the perceptual part of the DBN. The policy search part is a semiblack box algorithm, which we call BO-PI2 . It utilizes the action-perception relation to focus the high-level exploration, uses Gaussian Processes to model the expected-return and performs Upper Confidence Bound type low-level exploration for sampling the rollouts. BO-PI2 is compared against a stateof-the-art method on three different skills in a real robot setting with expert and naive user demonstrations. The results show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Gaussian Processes and Bayesian Inference