Towards Improving Learning from Demonstration Algorithms via MCMC   Methods

Carl Qi; Edward Sun; Harry Zhang

arXiv:2405.02243·cs.RO·May 27, 2024

Towards Improving Learning from Demonstration Algorithms via MCMC Methods

Carl Qi, Edward Sun, Harry Zhang

PDF

Open Access

TL;DR

This paper proposes enhancing learning from demonstration algorithms by using implicit energy-based policy models, which outperform traditional neural network models in complex robot policy learning scenarios, especially with discontinuous and multimodal functions.

Contribution

It introduces the use of implicit energy-based models for learning from demonstration, showing improved performance over explicit neural network models in complex tasks.

Findings

01

Implicit models outperform explicit neural networks in complex scenarios.

02

Energy-based policies better handle discontinuous and multimodal functions.

03

Results demonstrate improved learning efficiency and accuracy.

Abstract

Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own drawbacks, limiting its efficacy in real robot setups. In this work, we take one step towards improving learning from demonstration algorithms by leveraging implicit energy-based policy models. Results suggest that in selected complex robot policy learning scenarios, treating supervised policy learning with an implicit model generally performs better, on average, than commonly used neural network-based explicit models, especially in the cases of approximating potentially discontinuous and multimodal functions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Parallel Computing and Optimization Techniques