Adversarial Imitation Learning via Random Search
MyungJae Shin, Joongheon Kim

TL;DR
This paper introduces a simple, derivative-free imitation learning method using random search with linear policies, achieving competitive results on complex MuJoCo tasks without explicit reward signals.
Contribution
It proposes a novel imitation learning approach that employs random search and simple linear policies, simplifying the algorithm and enhancing reproducibility.
Findings
Achieves competitive performance on MuJoCo tasks
Uses derivative-free optimization with simple linear policies
Simplifies imitation learning algorithms
Abstract
Developing agents that can perform challenging complex tasks is the goal of reinforcement learning. The model-free reinforcement learning has been considered as a feasible solution. However, the state of the art research has been to develop increasingly complicated techniques. This increasing complexity makes the reconstruction difficult. Furthermore, the problem of reward dependency is still exists. As a result, research on imitation learning, which learns policy from a demonstration of experts, has begun to attract attention. Imitation learning directly learns policy based on data on the behavior of the experts without the explicit reward signal provided by the environment. However, imitation learning tries to optimize policies based on deep reinforcement learning such as trust region policy optimization. As a result, deep reinforcement learning based imitation learning also poses a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRandom Search
