Quality Diversity for Robot Learning: Limitations and Future Directions

Sumeet Batra; Bryon Tjanaka; Stefanos Nikolaidis; Gaurav Sukhatme

arXiv:2407.17515·cs.RO·July 26, 2024

Quality Diversity for Robot Learning: Limitations and Future Directions

Sumeet Batra, Bryon Tjanaka, Stefanos Nikolaidis, Gaurav Sukhatme

PDF

TL;DR

This paper critiques current Quality Diversity methods in robot learning, proposing a goal-conditioned policy approach with classical planning for better generalization and open-ended search, inspired by cognitive maps in neuroscience.

Contribution

It introduces a novel approach using a single goal-conditioned policy with classical planning, reducing complexity and enhancing generalization in QD for robot learning.

Findings

01

Single policy with classical planner achieves O(1) space complexity.

02

Approach generalizes to task variants.

03

Links QD with cognitive maps in neuroscience.

Abstract

Quality Diversity (QD) has shown great success in discovering high-performing, diverse policies for robot skill learning. While current benchmarks have led to the development of powerful QD methods, we argue that new paradigms must be developed to facilitate open-ended search and generalizability. In particular, many methods focus on learning diverse agents that each move to a different xy position in MAP-Elites-style bounded archives. Here, we show that such tasks can be accomplished with a single, goal-conditioned policy paired with a classical planner, achieving O(1) space complexity w.r.t. the number of policies and generalization to task variants. We hypothesize that this approach is successful because it extracts task-invariant structural knowledge by modeling a relational graph between adjacent cells in the archive. We motivate this view with emerging evidence from computational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus