Efficiently Guiding Imitation Learning Agents with Human Gaze

Akanksha Saran; Ruohan Zhang; Elaine Schaertl Short; Scott Niekum

arXiv:2002.12500·cs.LG·April 23, 2021·5 cites

Efficiently Guiding Imitation Learning Agents with Human Gaze

Akanksha Saran, Ruohan Zhang, Elaine Schaertl Short, Scott Niekum

PDF

Open Access

TL;DR

This paper introduces a novel gaze-guided auxiliary loss to improve imitation learning agents, significantly boosting performance across multiple algorithms and Atari games without additional test-time gaze data.

Contribution

It proposes a computationally efficient gaze-based auxiliary loss that enhances imitation learning without extra parameters or test-time gaze requirements.

Findings

01

Performance improved by up to 390% across Atari games

02

Outperforms prior gaze-assisted imitation learning methods

03

Reduces causal confusion in imitation learning

Abstract

Human gaze is known to be an intention-revealing signal in human demonstrations of tasks. In this work, we use gaze cues from human demonstrators to enhance the performance of agents trained via three popular imitation learning methods -- behavioral cloning (BC), behavioral cloning from observation (BCO), and Trajectory-ranked Reward EXtrapolation (T-REX). Based on similarities between the attention of reinforcement learning agents and human gaze, we propose a novel approach for utilizing gaze data in a computationally efficient manner, as part of an auxiliary loss function, which guides a network to have higher activations in image regions where the human's gaze fixated. This work is a step towards augmenting any existing convolutional imitation learning agent's training with auxiliary gaze data. Our auxiliary coverage-based gaze loss (CGL) guides learning toward a better reward…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics

MethodsDropout