Learning Task-relevant Representations for Generalization via   Characteristic Functions of Reward Sequence Distributions

Rui Yang; Jie Wang; Zijie Geng; Mingxuan Ye; Shuiwang Ji; Bin Li; Feng; Wu

arXiv:2205.10218·cs.LG·July 1, 2022

Learning Task-relevant Representations for Generalization via Characteristic Functions of Reward Sequence Distributions

Rui Yang, Jie Wang, Zijie Geng, Mingxuan Ye, Shuiwang Ji, Bin Li, Feng, Wu

PDF

1 Repo

TL;DR

This paper introduces CRESP, a method that enhances generalization in visual reinforcement learning by learning reward sequence distributions through characteristic functions, making representations invariant to visual distractions.

Contribution

CRESP is a novel approach that predicts characteristic functions of reward sequence distributions to learn task-relevant, distraction-invariant representations in visual RL.

Findings

01

CRESP significantly improves generalization performance on unseen environments.

02

CRESP outperforms several state-of-the-art methods on DeepMind Control tasks.

03

The method effectively captures task-relevant information despite visual distractions.

Abstract

Generalization across different environments with the same tasks is critical for successful applications of visual reinforcement learning (RL) in real scenarios. However, visual distractions -- which are common in real scenes -- from high-dimensional observations can be hurtful to the learned representations in visual RL, thus degrading the performance of generalization. To tackle this problem, we propose a novel approach, namely Characteristic Reward Sequence Prediction (CRESP), to extract the task-relevant information by learning reward sequence distributions (RSDs), as the reward signals are task-relevant in RL and invariant to visual distractions. Specifically, to effectively capture the task-relevant information via RSDs, CRESP introduces an auxiliary task -- that is, predicting the characteristic functions of RSDs -- to learn task-relevant representations, because we can well…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

miralab-ustc/rl-cresp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.