Investigating the Properties of Neural Network Representations in   Reinforcement Learning

Han Wang; Erfan Miahi; Martha White; Marlos C. Machado; Zaheer Abbas,; Raksha Kumaraswamy; Vincent Liu; Adam White

arXiv:2203.15955·cs.LG·May 8, 2023·1 cites

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas,, Raksha Kumaraswamy, Vincent Liu, Adam White

PDF

Open Access

TL;DR

This paper empirically investigates the properties of neural network representations in deep reinforcement learning, focusing on how these properties support transfer learning across tasks and environments.

Contribution

It introduces a systematic methodology to analyze and correlate representational properties with transfer performance in deep reinforcement learning agents.

Findings

01

Certain representational properties correlate with better transfer performance.

02

Representation quality varies with task similarity and training schemes.

03

Methodology generalizes across different RL environments and agents.

Abstract

In this paper we investigate the properties of representations learned by deep reinforcement learning systems. Much of the early work on representations for reinforcement learning focused on designing fixed-basis architectures to achieve properties thought to be desirable, such as orthogonality and sparsity. In contrast, the idea behind deep reinforcement learning methods is that the agent designer should not encode representational properties, but rather that the data stream should determine the properties of the representation -- good representations emerge under appropriate training schemes. In this paper we bring these two perspectives together, empirically investigating the properties of representations that support transfer in reinforcement learning. We introduce and measure six representational properties over more than 25 thousand agent-task settings. We consider Deep Q-learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Adversarial Robustness in Machine Learning

MethodsDense Connections · Q-Learning · Convolution · Deep Q-Network