Neuroevolution-Based Inverse Reinforcement Learning
Karan K. Budhraja, Tim Oates

TL;DR
This paper introduces a neuroevolution-based method for inverse reinforcement learning that uses neural networks to learn complex, non-linear feature combinations from demonstrations, improving performance in certain stochastic environments.
Contribution
It combines neuroevolution with feature-based inverse reinforcement learning, enabling non-linear state evaluations and extending Bayesian non-parametric feature construction techniques.
Findings
Neural networks improve the fit to observed demonstrations.
The approach is effective for linearly solvable non-deterministic MDPs.
A performance hierarchy among algorithms is established.
Abstract
The problem of Learning from Demonstration is targeted at learning to perform tasks based on observed examples. One approach to Learning from Demonstration is Inverse Reinforcement Learning, in which actions are observed to infer rewards. This work combines a feature based state evaluation approach to Inverse Reinforcement Learning with neuroevolution, a paradigm for modifying neural networks based on their performance on a given task. Neural networks are used to learn from a demonstrated expert policy and are evolved to generate a policy similar to the demonstration. The algorithm is discussed and evaluated against competitive feature-based Inverse Reinforcement Learning approaches. At the cost of execution time, neural networks allow for non-linear combinations of features in state evaluations. These valuations may correspond to state value or state reward. This results in better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
