Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Kibeom Kim; Min Whoo Lee; Yoonsung Kim; Je-Hwan Ryu; Minsu Lee,; Byoung-Tak Zhang

arXiv:2110.12985·cs.LG·October 27, 2021·1 cites

Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning

Kibeom Kim, Min Whoo Lee, Yoonsung Kim, Je-Hwan Ryu, Minsu Lee,, Byoung-Tak Zhang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a goal-aware cross-entropy loss and goal-discriminative attention networks to improve multi-target reinforcement learning by enhancing goal understanding, leading to better success rates and generalization in complex tasks.

Contribution

The paper proposes GACE loss and GDAN, novel methods that enable agents to better discriminate and focus on goals in multi-target environments using self-supervised learning.

Findings

01

GDAN outperforms state-of-the-art in success ratio

02

Improves sample efficiency and generalization

03

Enhances goal-directed focus in agents

Abstract

Learning in a multi-target environment without prior knowledge about the targets requires a large amount of samples and makes generalization difficult. To solve this problem, it is important to be able to discriminate targets through semantic understanding. In this paper, we propose goal-aware cross-entropy (GACE) loss, that can be utilized in a self-supervised way using auto-labeled goal states alongside reinforcement learning. Based on the loss, we then devise goal-discriminative attention networks (GDAN) which utilize the goal-relevant information to focus on the given instruction. We evaluate the proposed methods on visual navigation and robot arm manipulation tasks with multi-target environments and show that GDAN outperforms the state-of-the-art methods in terms of task success ratio, sample efficiency, and generalization. Additionally, qualitative analyses demonstrate that our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kibeomkim/gace-gdan
pytorchOfficial

Videos

Goal-Aware Cross-Entropy for Multi-Target Reinforcement Learning· slideslive

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning

MethodsAttentive Walk-Aggregating Graph Neural Network