TL;DR
This paper investigates how natural language advice can enhance reinforcement learning by replacing traditional human feedback with language-based guidance, improving policy shaping in complex environments.
Contribution
It introduces a novel approach that integrates natural language advice into reinforcement learning, replacing human feedback policies with language-generated guidance for better decision-making.
Findings
Natural language advice improves policy learning.
Language-guided reinforcement learning outperforms traditional methods.
The model effectively integrates advice to assist decision-making.
Abstract
Interactive reinforcement learning agents use human feedback or instruction to help them learn in complex environments. Often, this feedback comes in the form of a discrete signal that is either positive or negative. While informative, this information can be difficult to generalize on its own. In this work, we explore how natural language advice can be used to provide a richer feedback signal to a reinforcement learning agent by extending policy shaping, a well-known Interactive reinforcement learning technique. Usually policy shaping employs a human feedback policy to help an agent to learn more about how to achieve its goal. In our case, we replace this human feedback policy with policy generated based on natural language advice. We aim to inspect if the generated natural language reasoning provides support to a deep reinforcement learning agent to decide its actions successfully in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
