TL;DR
This paper introduces a novel adversarial skill learning framework that derives transferable, task-agnostic skill embeddings from unlabeled multi-view videos, enabling robots to learn and reuse skills across diverse tasks without explicit rewards.
Contribution
The paper proposes an unsupervised, adversarial approach to learn a general skill embedding space from unlabeled videos, improving skill reuse and transferability in reinforcement learning.
Findings
Effective skill transfer demonstrated in both simulation and real-world data.
Enables training of control policies for new tasks through skill interpolation.
Outperforms baseline methods in learning transferable skills from unlabeled videos.
Abstract
Key challenges for the deployment of reinforcement learning (RL) agents in the real world are the discovery, representation and reuse of skills in the absence of a reward function. To this end, we propose a novel approach to learn a task-agnostic skill embedding space from unlabeled multi-view videos. Our method learns a general skill embedding independently from the task context by using an adversarial loss. We combine a metric learning loss, which utilizes temporal video coherence to learn a state representation, with an entropy regularized adversarial skill-transfer loss. The metric learning loss learns a disentangled representation by attracting simultaneous viewpoints of the same observations and repelling visually similar frames from temporal neighbors. The adversarial skill-transfer loss enhances re-usability of learned skill embeddings over multiple task domains. We show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
