Model-Based Quality-Diversity Search for Efficient Robot Learning
Leon Keller, Daniel Tanneberg, Svenja Stark, Jan Peters

TL;DR
This paper introduces a model-based approach to improve the efficiency and effectiveness of quality-diversity search algorithms for robot learning by integrating a neural network for behavior prediction.
Contribution
It presents the Model-based Quality-Diversity (M-QD) algorithm that combines neural network predictions with QD to enhance sample-efficiency and skill generalization in robot learning.
Findings
Enhanced sample-efficiency in QD algorithms.
Improved skill adaptation and generalization.
Better performance in open-ended object manipulation tasks.
Abstract
Despite recent progress in robot learning, it still remains a challenge to program a robot to deal with open-ended object manipulation tasks. One approach that was recently used to autonomously generate a repertoire of diverse skills is a novelty based Quality-Diversity~(QD) algorithm. However, as most evolutionary algorithms, QD suffers from sample-inefficiency and, thus, it is challenging to apply it in real-world scenarios. This paper tackles this problem by integrating a neural network that predicts the behavior of the perturbed parameters into a novelty based QD algorithm. In the proposed Model-based Quality-Diversity search (M-QD), the network is trained concurrently to the repertoire and is used to avoid executing unpromising actions in the novelty search process. Furthermore, it is used to adapt the skills of the final repertoire in order to generalize the skills to different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
