Offline Diversity Maximization Under Imitation Constraints

Marin Vlastelica; Jin Cheng; Georg Martius; Pavel Kolev

arXiv:2307.11373·cs.LG·June 24, 2024

Offline Diversity Maximization Under Imitation Constraints

Marin Vlastelica, Jin Cheng, Georg Martius, Pavel Kolev

PDF

Open Access

TL;DR

This paper introduces an offline algorithm for unsupervised skill discovery that maximizes diversity while ensuring imitation of expert demonstrations, addressing online interaction and data utilization challenges.

Contribution

It connects Fenchel duality, reinforcement learning, and mutual information to develop a novel offline skill discovery method with imitation constraints.

Findings

01

Effective on D4RL benchmark datasets.

02

Successful transfer from simulation to real robot.

03

Balances diversity and imitation in offline setting.

Abstract

There has been significant recent progress in the area of unsupervised skill discovery, utilizing various information-theoretic objectives as measures of diversity. Despite these advances, challenges remain: current methods require significant online interaction, fail to leverage vast amounts of available task-agnostic data and typically lack a quantitative measure of skill utility. We address these challenges by proposing a principled offline algorithm for unsupervised skill discovery that, in addition to maximizing diversity, ensures that each learned skill imitates state-only expert demonstrations to a certain degree. Our main analytical contribution is to connect Fenchel duality, reinforcement learning, and unsupervised skill discovery to maximize a mutual information objective subject to KL-divergence state occupancy constraints. Furthermore, we demonstrate the effectiveness of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications