Influencing Reinforcement Learning through Natural Language Guidance

Tasmia Tasrin; Md Sultan Al Nahian; Habarakadage Perera; Brent; Harrison

arXiv:2104.01506·cs.AI·April 13, 2021

Influencing Reinforcement Learning through Natural Language Guidance

Tasmia Tasrin, Md Sultan Al Nahian, Habarakadage Perera, Brent, Harrison

PDF

1 Repo

TL;DR

This paper investigates how natural language advice can enhance reinforcement learning by replacing traditional human feedback with language-based guidance, improving policy shaping in complex environments.

Contribution

It introduces a novel approach that integrates natural language advice into reinforcement learning, replacing human feedback policies with language-generated guidance for better decision-making.

Findings

01

Natural language advice improves policy learning.

02

Language-guided reinforcement learning outperforms traditional methods.

03

The model effectively integrates advice to assist decision-making.

Abstract

Interactive reinforcement learning agents use human feedback or instruction to help them learn in complex environments. Often, this feedback comes in the form of a discrete signal that is either positive or negative. While informative, this information can be difficult to generalize on its own. In this work, we explore how natural language advice can be used to provide a richer feedback signal to a reinforcement learning agent by extending policy shaping, a well-known Interactive reinforcement learning technique. Usually policy shaping employs a human feedback policy to help an agent to learn more about how to achieve its goal. In our case, we replace this human feedback policy with policy generated based on natural language advice. We aim to inspect if the generated natural language reasoning provides support to a deep reinforcement learning agent to decide its actions successfully in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sultanalnahian/Reverse-Frogger-A3PS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.