Boosting Reinforcement Learning and Planning with Demonstrations: A   Survey

Tongzhou Mu; Hao Su

arXiv:2303.13489·cs.LG·March 29, 2023·1 cites

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Tongzhou Mu, Hao Su

PDF

Open Access

TL;DR

This survey reviews how demonstrations can enhance reinforcement learning and planning by providing expert guidance, discussing methods of integration, collection strategies, and practical applications like the ManiSkill benchmark.

Contribution

It offers a comprehensive overview of demonstration-based methods in decision making, highlighting new approaches and practical pipelines for their generation and use.

Findings

01

Demonstrations improve learning efficiency in complex environments.

02

Various methods exist for integrating demonstrations into reinforcement learning and planning.

03

A practical pipeline for generating and utilizing demonstrations is demonstrated on the ManiSkill benchmark.

Abstract

Although reinforcement learning has seen tremendous success recently, this kind of trial-and-error learning can be impractical or inefficient in complex environments. The use of demonstrations, on the other hand, enables agents to benefit from expert knowledge rather than having to discover the best action to take through exploration. In this survey, we discuss the advantages of using demonstrations in sequential decision making, various ways to apply demonstrations in learning-based decision making paradigms (for example, reinforcement learning and planning in the learned models), and how to collect the demonstrations in various scenarios. Additionally, we exemplify a practical pipeline for generating and utilizing demonstrations in the recently proposed ManiSkill robot learning benchmark.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Data Classification