Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces

Garrett Warnell; Nicholas Waytowich; Vernon Lawhern; Peter Stone

arXiv:1709.10163·cs.AI·January 23, 2018

Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces

Garrett Warnell, Nicholas Waytowich, Vernon Lawhern, Peter Stone

PDF

2 Repos

TL;DR

Deep TAMER integrates deep learning with human-in-the-loop reinforcement learning, enabling rapid training of agents in high-dimensional spaces using real-time feedback, demonstrated by outperforming humans in Atari Bowling with minimal feedback.

Contribution

It introduces Deep TAMER, a novel framework combining deep neural networks with human feedback for efficient learning in complex, high-dimensional environments.

Findings

01

Deep TAMER successfully trained an agent to outperform humans in Atari Bowling.

02

The framework requires only 15 minutes of human feedback for effective learning.

03

Deep TAMER outperforms traditional reinforcement learning methods in complex tasks.

Abstract

While recent advances in deep reinforcement learning have allowed autonomous learning agents to succeed at a variety of complex tasks, existing algorithms generally require a lot of training data. One way to increase the speed at which agents are able to learn to perform tasks is by leveraging the input of human trainers. Although such input can take many forms, real-time, scalar-valued feedback is especially useful in situations where it proves difficult or impossible for humans to provide expert demonstrations. Previous approaches have shown the usefulness of human input provided in this fashion (e.g., the TAMER framework), but they have thus far not considered high-dimensional state spaces or employed the use of deep learning. In this paper, we do both: we propose Deep TAMER, an extension of the TAMER framework that leverages the representational power of deep neural networks in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings