Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning
Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

TL;DR
This paper introduces a framework that uses generative adversarial imitation learning with external memory modules to learn complex, multi-step strategies in human decision making, improving long-term planning and policy learning.
Contribution
It proposes a novel memory-augmented imitation learning framework that captures subtask dependencies and task-level strategies, enhancing learning from complex demonstrations.
Findings
Successfully learned expert policies in autonomous highway driving
Outperformed state-of-the-art imitation learning methods
Effectively modeled subtask relationships and task-level strategies
Abstract
This paper presents a novel framework for automatic learning of complex strategies in human decision making. The task that we are interested in is to better facilitate long term planning for complex, multi-step events. We observe temporal relationships at the subtask level of expert demonstrations, and determine the different strategies employed in order to successfully complete a task. To capture the relationship between the subtasks and the overall goal, we utilise two external memory modules, one for capturing dependencies within a single expert demonstration, such as the sequential relationship among different sub tasks, and a global memory module for modelling task level characteristics such as best practice employed by different humans based on their domain expertise. Furthermore, we demonstrate how the hidden state representation of the memory can be used as a reward signal to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Artificial Intelligence in Games
