Visual Adversarial Imitation Learning using Variational Models
Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

TL;DR
This paper introduces V-MAIL, a variational model-based adversarial imitation learning algorithm that efficiently learns visuomotor policies from visual demonstrations, improving stability and performance over prior methods.
Contribution
The paper presents a novel variational model-based adversarial imitation learning approach that enhances stability, sample efficiency, and transferability in visual imitation learning tasks.
Findings
V-MAIL learns successful visuomotor policies efficiently.
It demonstrates improved stability over previous methods.
It enables transfer learning of new tasks without additional environment interactions.
Abstract
Reward function specification, which requires considerable human effort and iteration, remains a major impediment for learning behaviors through deep reinforcement learning. In contrast, providing visual demonstrations of desired behaviors often presents an easier and more natural way to teach agents. We consider a setting where an agent is provided a fixed dataset of visual demonstrations illustrating how to perform a task, and must learn to solve the task using the provided demonstrations and unsupervised environment interactions. This setting presents a number of challenges including representation learning for visual observations, sample complexity due to high dimensional spaces, and learning instability due to the lack of a fixed reward or learning signal. Towards addressing these challenges, we develop a variational model-based adversarial imitation learning (V-MAIL) algorithm.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Advanced Vision and Imaging
