TL;DR
This paper introduces a hierarchical approach to quality-diversity algorithms, enabling robots to learn complex skills efficiently and adapt to damages with fewer actions and failures, demonstrated on a hexapod robot in maze navigation tasks.
Contribution
The paper proposes the Hierarchical Trial and Error algorithm, which uses hierarchical behavioral repertoires to improve learning complexity and adaptability in robotic damage recovery.
Findings
Solves maze navigation with 20% fewer actions in challenging scenarios.
Reduces complete failures by 57%.
Enables learning of more complex behaviors efficiently.
Abstract
Adaptation capabilities, like damage recovery, are crucial for the deployment of robots in complex environments. Several works have demonstrated that using repertoires of pre-trained skills can enable robots to adapt to unforeseen mechanical damages in a few minutes. These adaptation capabilities are directly linked to the behavioural diversity in the repertoire. The more alternatives the robot has to execute a skill, the better are the chances that it can adapt to a new situation. However, solving complex tasks, like maze navigation, usually requires multiple different skills. Finding a large behavioural diversity for these multiple skills often leads to an intractable exponential growth of the number of required solutions. In this paper, we introduce the Hierarchical Trial and Error algorithm, which uses a hierarchical behavioural repertoire to learn diverse skills and leverages them…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
