Temporal Video-Language Alignment Network for Reward Shaping in   Reinforcement Learning

Ziyuan Cao; Reshma Anugundanahalli Ramachandra; Kelin Yu

arXiv:2302.03954·cs.RO·February 9, 2023·1 cites

Temporal Video-Language Alignment Network for Reward Shaping in Reinforcement Learning

Ziyuan Cao, Reshma Anugundanahalli Ramachandra, Kelin Yu

PDF

Open Access

TL;DR

This paper introduces a natural language-based reward shaping method for reinforcement learning in complex environments, improving task completion rates by mapping trajectories to language instructions and generating intermediate rewards.

Contribution

It extends the LEARN framework to map trajectories to natural language, enabling more effective reward shaping in RL for Atari games.

Findings

01

Ext-LEARN outperforms previous reward shaping methods.

02

Improves task success rates in Montezuma's Revenge.

03

Effective integration of language instructions enhances RL learning.

Abstract

Designing appropriate reward functions for Reinforcement Learning (RL) approaches has been a significant problem, especially for complex environments such as Atari games. Utilizing natural language instructions to provide intermediate rewards to RL agents in a process known as reward shaping can help the agent in reaching the goal state faster. In this work, we propose a natural language-based reward shaping approach that maps trajectories from the Montezuma's Revenge game environment to corresponding natural language instructions using an extension of the LanguagE-Action Reward Network (LEARN) framework. These trajectory-language mappings are further used to generate intermediate rewards which are integrated into reward functions that can be utilized to learn an optimal policy for any standard RL algorithms. For a set of 15 tasks from Atari's Montezuma's Revenge game, the Ext-LEARN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games