Adversarial Imitation Learning via Random Search

MyungJae Shin; Joongheon Kim

arXiv:2008.09450·cs.LG·August 24, 2020

Adversarial Imitation Learning via Random Search

MyungJae Shin, Joongheon Kim

PDF

TL;DR

This paper introduces a simple, derivative-free imitation learning method using random search with linear policies, achieving competitive results on complex MuJoCo tasks without explicit reward signals.

Contribution

It proposes a novel imitation learning approach that employs random search and simple linear policies, simplifying the algorithm and enhancing reproducibility.

Findings

01

Achieves competitive performance on MuJoCo tasks

02

Uses derivative-free optimization with simple linear policies

03

Simplifies imitation learning algorithms

Abstract

Developing agents that can perform challenging complex tasks is the goal of reinforcement learning. The model-free reinforcement learning has been considered as a feasible solution. However, the state of the art research has been to develop increasingly complicated techniques. This increasing complexity makes the reconstruction difficult. Furthermore, the problem of reward dependency is still exists. As a result, research on imitation learning, which learns policy from a demonstration of experts, has begun to attract attention. Imitation learning directly learns policy based on data on the behavior of the experts without the explicit reward signal provided by the environment. However, imitation learning tries to optimize policies based on deep reinforcement learning such as trust region policy optimization. As a result, deep reinforcement learning based imitation learning also poses a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRandom Search