Injective State-Image Mapping facilitates Visual Adversarial Imitation   Learning

Subhajit Chaudhury; Daiki Kimura; Asim Munawar; Ryuki Tachibana

arXiv:1810.01108·cs.LG·October 28, 2019

Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning

Subhajit Chaudhury, Daiki Kimura, Asim Munawar, Ryuki Tachibana

PDF

Open Access

TL;DR

This paper introduces a deep learning framework that enables imitation learning directly from raw videos by leveraging an injective mapping between joint states and video streams, reducing reliance on hand-crafted rewards.

Contribution

It demonstrates that adversarial imitation learning on raw videos is equivalent to learning from state trajectories under an injective mapping, enabling effective policy learning without explicit reward functions.

Findings

01

Achieves similar performance to state-of-the-art methods using raw videos

02

Outperforms existing hand-crafted video imitation techniques

03

Can learn from YouTube videos with comparable results to reward-based methods

Abstract

The growing use of virtual autonomous agents in applications like games and entertainment demands better control policies for natural-looking movements and actions. Unlike the conventional approach of hard-coding motion routines, we propose a deep learning method for obtaining control policies by directly mimicking raw video demonstrations. Previous methods in this domain rely on extracting low-dimensional features from expert videos followed by a separate hand-crafted reward estimation step. We propose an imitation learning framework that reduces the dependence on hand-engineered reward functions by jointly learning the feature extraction and reward estimation steps using Generative Adversarial Networks (GANs). Our main contribution in this paper is to show that under injective mapping between low-level joint state (angles and velocities) trajectories and corresponding raw video…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning · Reinforcement Learning in Robotics