Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires
Bryan Lim, Luca Grillotti, Lorenzo Bernasconi, Antoine Cully

TL;DR
This paper introduces DA-QD, a dynamics-aware framework that significantly improves the sample efficiency of quality-diversity algorithms for robotic skill learning, enabling zero-shot generalization and real-world adaptation.
Contribution
The paper presents a novel dynamics-aware approach that enhances QD algorithms with learned dynamics models for more efficient and continual skill repertoire discovery.
Findings
DA-QD is 20 times more sample efficient than existing methods.
Enables zero-shot learning of new skill repertoires.
Effective for long horizon navigation and damage adaptation.
Abstract
Quality-Diversity (QD) algorithms are powerful exploration algorithms that allow robots to discover large repertoires of diverse and high-performing skills. However, QD algorithms are sample inefficient and require millions of evaluations. In this paper, we propose Dynamics-Aware Quality-Diversity (DA-QD), a framework to improve the sample efficiency of QD algorithms through the use of dynamics models. We also show how DA-QD can then be used for continual acquisition of new skill repertoires. To do so, we incrementally train a deep dynamics model from experience obtained when performing skill discovery using QD. We can then perform QD exploration in imagination with an imagined skill repertoire. We evaluate our approach on three robotic experiments. First, our experiments show DA-QD is 20 times more sample efficient than existing QD approaches for skill discovery. Second, we demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Semantic Web and Ontologies · Distributed and Parallel Computing Systems
