Discovering Unsupervised Behaviours from Full-State Trajectories

Luca Grillotti; Antoine Cully

arXiv:2211.15451·cs.LG·November 29, 2022·1 cites

Discovering Unsupervised Behaviours from Full-State Trajectories

Luca Grillotti, Antoine Cully

PDF

Open Access

TL;DR

This paper introduces an autonomous Quality-Diversity algorithm that discovers behavioral characterizations from full-state trajectories, enabling robots to autonomously identify diverse skills without prior task knowledge.

Contribution

The work presents a novel method for autonomous behavioral discovery in robots, removing the need for hand-coded descriptors in Quality-Diversity algorithms.

Findings

01

The algorithm autonomously finds diverse movement policies.

02

It discovers varied ways for robots to utilize their legs.

03

It identifies multiple behaviors including navigation and half-rolls.

Abstract

Improving open-ended learning capabilities is a promising approach to enable robots to face the unbounded complexity of the real-world. Among existing methods, the ability of Quality-Diversity algorithms to generate large collections of diverse and high-performing skills is instrumental in this context. However, most of those algorithms rely on a hand-coded behavioural descriptor to characterise the diversity, hence requiring prior knowledge about the considered tasks. In this work, we propose an additional analysis of Autonomous Robots Realising their Abilities; a Quality-Diversity algorithm that autonomously finds behavioural characterisations. We evaluate this approach on a simulated robotic environment, where the robot has to autonomously discover its abilities from its full-state trajectories. All algorithms were applied to three tasks: navigation, moving forward with a high…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Optimization and Search Problems