Learning Temporal Strategic Relationships using Generative Adversarial   Imitation Learning

Tharindu Fernando; Simon Denman; Sridha Sridharan; Clinton Fookes

arXiv:1805.04969·cs.CV·May 15, 2018·20 cites

Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning

Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

PDF

Open Access

TL;DR

This paper introduces a framework that uses generative adversarial imitation learning with external memory modules to learn complex, multi-step strategies in human decision making, improving long-term planning and policy learning.

Contribution

It proposes a novel memory-augmented imitation learning framework that captures subtask dependencies and task-level strategies, enhancing learning from complex demonstrations.

Findings

01

Successfully learned expert policies in autonomous highway driving

02

Outperformed state-of-the-art imitation learning methods

03

Effectively modeled subtask relationships and task-level strategies

Abstract

This paper presents a novel framework for automatic learning of complex strategies in human decision making. The task that we are interested in is to better facilitate long term planning for complex, multi-step events. We observe temporal relationships at the subtask level of expert demonstrations, and determine the different strategies employed in order to successfully complete a task. To capture the relationship between the subtasks and the overall goal, we utilise two external memory modules, one for capturing dependencies within a single expert demonstration, such as the sequential relationship among different sub tasks, and a global memory module for modelling task level characteristics such as best practice employed by different humans based on their domain expertise. Furthermore, we demonstrate how the hidden state representation of the memory can be used as a reward signal to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Artificial Intelligence in Games